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5 60/1 5 1 ,440 filed August 30, 1 999. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

OR DEVELOPMENT 
The research project which gave rise to the invention described in this patent 
application was supported by EPA grant R824726-01 . The United States Government 
10 may have certain rights in this invention. 



BACKGROUND OF THE INVENTION 
The technology of genetic engineering allows the transfer of genetic traits 
between species and permits, in particular, the transfer of enzymes from one species to 
others. These techniques have first reached commercialization in connection with high- 
15 value added products such as pharmaceuticals. The techniques of genetic engineering 
are equally applicable and cost effective when applied to genes and enzymes which can 
be used to make basic chemical feedstocks. 

A metabolic pathway of interest exists in the bacteria Klebsiella pneumoniae, 
which has the ability to biologically produce 3 - hydroxypropionaldehyde from glycerol. 
20 Native microorganisms have the ability to produce 1,3 - propanediol from glycerol as 
well. Commercial interests are exploring the production of 1 ,3 - propanediol from 
glycerol or glucose, in recombinant organisms which have been engineered to express 
the enzymes necessary for 1,3- propanediol production from other organisms. 

3 - hydroxypropionic acid CAS registry Number [503-66-2] (abbreviated as 3- 
25 HP) is a three carbon non-chiral organic molecule. The IUP AC nomenclature name for 



-1- 



P C T , " U S Q O 2 3 C 7' S - 



this molecule is propionic acid 3 - hydroxy. It is also known as 3 - hydroxypropionate, 
P - hydroxpropionic acid, p - hydroxypropionate, 3 - hydroxypropionic acid, 3 - 
hydroxypropanoate, hydracrylic acid, ethylene lactic acid, p -lactic acid and 2 - 
deoxyglyceric acid. Applications of 3-HP include the manufacture of absorbable 
5 prosthetic devices and surgical sutures, incorporation into beta-lactams, production of 
acrylic acid, formation of trifluromethylated alcohols or diols, polyhydroxyalkonates, 
and co-polymers with lactic acid. 3-HP for commercial use is now commonly produced 
by organic chemical syntheses. The 3-HP produced and sold by these methods is 
relatively expensive, and it would be cost prohibitive to use it for the production of 
10 monomers for polymer production. As discussed below, some organisms are known to 
produce 3-HP. However, there is not yet available a catalog of genes from these 
organisms and thus the ability to synthesize 3-HP using the enzymes natively 
responsible for the synthesis of that molecule in the native hosts which produce it does 
not now exist. 

15 In addition to its commercial utility, 3-HP it is found in a number of biological 

processes, notably including many naturally occurring bio-polymers. Poly(3 - 
hydroxybutyrate) (PHB) is the most abundant member of the microbial polyesters which 
contain hydroxy monomers termed polyhydroxyalkonates (PHAs). PHB has utility as a 
biodegradable thermoplastic material and the material was first produced industrially in 

20 1982. 

The majority of published research on PHA's that contain 3-HP has concentrated 
on two bacterial sources: Ralstonia eutropha ("Alcaligenes eutrophus") and 
Pseudomonas oleovorans. Both Ralstonia eutropha and Pseudomonas oleovorans are 
able to grow on a nitrogen free media containing 3 - hydroxy - propionic acid, 1,5 - 

25 pentanediol or 1,7 - heptanediol. When 3-HP is the major hydroxy-acid added to the 
growth media, poly(3 - hydroxybutyrate - co - 3 - hydroxypropionic acid) is formed 
containing 7 mol % 3 - hydroxypropionic acid. These cells also store 3 mol %, 3 - 
hydroxypropionic acid poly(3 - butyrate - co - 3 - hydroxypropionic acid). 

Recombinant systems have been used to create PHAs. An E, coli strain 

30 engineered to express PHA synthase from either Ralstonia eutropha or Zoolgoea 
ramigera produced poly(3 - hydroxypropionic acid) when feed 1,3 - propanediol. 
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Skraly, F. A. "Polyhydroxyalkanoates Produced by Recombinant E. colL" Poster at 
Engineering Foundati on Conference: Metabolic Engineering II. 1998. An E. coli strain 
that expressed PHA synthase (MBX820), when provided with the genes encoding 
glycerol dehydratase and 1,3 - propanediol dehydratase from K pneumonia , and 4 - 
5 hydroxybutyral- CoA transferase from Clostridium kluyveri, synthesized PHB from 
glucose. 

Glycerol dehydratase, found in the bacterial pathway for the conversion of 
glycerol to 1,3 - propanediol, catalyzes the conversion of glycerol to 3 - 
hydroxypropionaldehyde and water. This enzyme has been found in a number of 

10 bacteria including strains of Citrobacter, Klebsiella, Lactobacillus, Entrobacter and 
Clostridium. In the 1,3 - propanediol pathway a second enzyme 1,3 - propanediol 
oxido-reductase (EC 1.1.202) reduces 3 - hydroxypropanaldehyde to 1,3 - 
propanediol in a NADH dependant reaction. The pathway for the conversion of 
glycerol to 1,3 - propanediol has been expressed in E. coli. Tong et aL, A pplied and 

15 Environmental Microbiology 57 (12) 3541-3546. The genes responsible for the 
production of 1,3 - propanediol were cloned from the dha regulon of Klebsiella 
pneumoniae. Glycerol is transported into the cell by the glycerol facilitator, and then 
converted into 3 - hydroxy - propionaldehyde by a coenzyme B 12 - dependent 
dehydratase. E. coli lacks a native dha regulon, consequently E. coli cannot grow 
20 anaerobically on glycerol without an exogenous electron acceptor such as nitrate or 
fumarate. 

Aldehyde dehydrogenases are enzymes that catalyze the oxidation of aldehydes 
to carboxylic acids. The genes encoding non-specific aldehyde dehydrogenases have 
been identified in a wide variety of organisms e.g.; ALDH2 from Homo sapiens , ALD4 
25 from Saccharomyces cerevisiae, and from E. coli both aldA and aldB, to name a few. 
These enzymes are classified by co-factor usage, most require either AND + , or NADP + 
and some will use either co-factor. The genes singled out for mention here are able to 
act on a number of different aldehydes and it likely that they may be able to oxidize 3 - 
hydroxy - propionaldehyde to 3 - hydroxypropionic acid. 
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BRIEF SUMMARY OF THE INVENTION 
The present invention is intended to permit the creation of a recombinant 
microbial host which is capable of synthesizing 3 -HP from a starting material of 
glycerol or glucose. The glycerol or glucose is converted to 3 - 
5 hydroxypropionicaldehyde (abbreviated as 3 -HP A) which is then converted to 3-HP. 
This process requires the so-called dhaB gene from Klebsiella pneumoniae which 
encodes the enzyme glycerol dehydratase any one of four different aldehyde 
dehydrogenase genes to convert 3-HP A to 3-HP. The four aldehyde dehydrogenase 
genes used were aldA from the bacterium E. coli, ALDH2 from humans, ALD4 from the 
10 yeast Saccharomyces cerevisiae, and aldB from E. coli. The yeast gene appeared to give 
the best results. 

It is an object of the present invention to provide a genetic construct which 
encodes glycerol dehydratase and aldehyde dehydrogenase enzymes necessary for the 
production of 3 - hydroxypropionic acid from glycerol. 
15 It is also an object of the present invention to provide a method for the 

production of 3 - hydroxypropionic acid from glycerol. 

Other features and advantages of the invention will be apparent from the 
following description of the preferred embodiment thereof and from the claims. 



20 BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

Not applicable. 

DETAILED DESCRIPTION OF THE INVENTION 
It is disclosed here that it is possible to introduce into a bacterial host genes 
encoding two enzymes and thus confer upon that host the ability to produce 3-HP from 
25 glycerol. The two necessary enzymes are glycerol dehydratase and aldehyde 
dehydrogenase. It is here reported that the two enzymes are both necessary and 
sufficient to enable a strain of a suitable host, such as a competent E. coli strain, to make 
3-HP from glycerol. An exemplary gene encoding a glycerol dehydratase is known, the 
dhaB gene from Klebsiella pneumoniae, sequenced and rendered convenient to use. 
30 Several exemplary aldehyde dehydrogenases are known, and their sequences are 
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presented here. From this information, it becomes practical to confer upon a bacterial 
host the ability to convert glycerol into 3-HP in a commercially reasonable manner. 

It was not apparent before the completion of the work described here that these 
two diverse enzymes could be produced in a common host to produce the ability to 
5 make 3-HP. There are many known aldehyde dehydrogenase enzymes and genes, and 
the enzymes are known to have varying substrate specificities and efficiencies. There 
was not evidence, prior to the work described here, that the aldehyde dehydrogenase 
enzyme would work on the 3-hydroxypropionicaldehyde (3-HPA) substrate to create 3- 
HP. Without that knowledge, there was no data from which to predict the effectiveness 
10 of the 3-HP production studies described below. An additional uncertainty arises from 
the fact that the intermediate aldehyde, 3-HPA, is toxic to many bacterial host and thus 
the survival of the host is dependent upon the relative rates of enzymatic production and 
conversion of the aldehyde intermediate to non-toxic 3-HP. 

A difficulty in the realization of the production of 3-HP desired here is that 
1 5 ribosome binding sites from non-native hosts are often ineffectual and lead to poor 

protein production and that many non-native promoters are often poorly transcribed and 
a bar to high protein expression. However, the inventors also recognized that a non- 
native promoter that is known to be very active and is inducible by the addition of a 
small molecule unrelated to the pathway being expressed is often a very efficient way to 
20 express and regulate the levels of enzymes expressed in hosts such as E. coli. To 
achieve high levels of regulated gene expression plasmids were constructed which 
placed the expression of all exogenous genes necessary for the production of 3 - 
hydroxypropionic acid from glycerol under the regulation of the trc promoter. The trc 
promoter, is efficient, not native to E. coli, and inducible by the addition of IPTG. 
25 The present specification describes a genetic construct for use in the production 

of 3 - hydroxypropionic acid from glycerol. The genetic construct includes exemplary 
DNA sequences coding for the expression of a glycerol dehydratase and a DNA 
sequence coding for aldehyde dehydrogenase. The set of exemplary sequences 
necessary for the expression of glycerol dehydratase is collectively referred to as 
30 "dhaB". The set of sequences necessary for the expression of aldehyde dehydrogenase 
includes any one of four different genes which proved efficacious. The individual 
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aldehyde dehydrogenase sequences referred to individually as ALDH4, ALD2, aldA and 
aldB. 

Producing 3 - hvdroxvpropionic acid in a foreign host 

In the work described below, the enzymes necessary for the production of 3 - 
5 hydroxypropionic acid from glycerol in E. coli were expressed under the regulation of 
the trc promoter, a non-native promoter inducible by the addition of IPTG. The 
glycerol dehydratase was encoded by the dhaB gene from Klebsiella pneumoniae, the 
aldehyde dehydrogenases used was any one of four different genes (ALDH2 from Homo 
sapiens, ALD4 from S. cerevisiae, aldB from E. coli or aldA from E. coli). Expression 

10 of these genes coding for glycerol dehydratase and any one of the genes encoding an 
aldehyde dehydrogenases was sufficient to enable the construct to produce 3-HP when 
the fermentation media was supplemented with glycerol. In all of these constructs, the 
dhaB gene was downstream from the gene encoding the aldehyde dehydrogenase used, 
and expression of both genes was regulated by the trc promoter. This order, however, is 

15 not required and the order of the gens on a construct and the use of multiple constructs is 
possible. 

In a minimal genetic construct made based on the data presented here, the only 
genetic elements present that would be necessary are the structural genes dhaB and an 
aldehyde dehydrogenase gene encoding a protein that efficiently catalyzes the oxidation 

20 of 3-hydroxypropionaldehyde to 3-hydroxypropionic acid, and non-native promoter 

sequences specifically selected to give the type of inducible control most appropriate for 
the context of the process in which the construct is to be used. Extraneous pieces of 
DNA, whether retained in the construct or added from other DNA sequences, would not 
necessarily be detrimental to effective 3-HP synthesis by the host organism, but would 

25 not be needed. Each sequence to be translated would necessarily be preceded by a 
ribosome binding site, functional in the selected host so that the messenger RNA(s) 
coding for the proteins of interest could be translated by ribosomes. Terminator 
sequences immediately downstream of each translated unit would also be necessary in 
some organisms, particularly in eukaryotes. The construct could be part of an 

30 autonomously replicating sequence, such as a plasmid or phage vector, or could be 
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integrated into the genome of the host. 

The structural genes and appropriate promoters) could be isolated by the use of 
restriction enzymes, by the polymerase chain reaction (PCR), by chemical synthesis of 
the appropriate oligonucleotides, or by other methods apparent to those skilled in the art 
5 or molecular biology. The promoters) would be derived from genomic DNA of other 
organism or from artificial genetic constructs containing promoters. Appropriate 
promoter fragments would be ligated into the construct upstream of the structural genes 
in any one of several possible arrangements. 

The aldehyde dehydrogenase expressed would have: high specific activity 

10 towards 3-hydroxypropionaldehyde; be very stable in the host it is expressed in; be 
readily over expressed in the selected host; not be inhibited by either the substrates 
necessary for the reaction or the products formed by the reaction; be fully active under 
the fermentation conditions most favorable for the production of 3 - hydroxypropionic 
acid and be able to use either NAD + or NADP + . 

15 One possible arrangement is the true operon, where one promoter is used to 

direct transcription in one direction of all necessary Open Reading Frames (ORFs). The 
entire message is then contained in one messenger RNA. The advantages of the operon 
are that it is relatively easy to construct, since only one promoter is needed; that is it is 
relatively simple to replace the promoter with another promoter if that would be 

20 desirable later; and that it assures that the two genes are under the same regulation. The 
main disadvantage of the operon scheme is that the levels of the expression of the two 
genes cannot be varied independently. If it is found that the genes, for optimal 3 - 
hydroxypropionic acid synthesis, should be expressed at different levels, the operon in 
most cases cannot be used to realize this. 

25 Another possible arrangement is the multiple-promoter scheme. Two or more 

promoters, with the same or distinct regulatory behavior, could be used to direct 
transcription of the genes. For example, one promoter could be used to direct 
transcription of dhaB and one to direct transcription of the gene encoding the 
appropriate aldehyde dehydrogenases. Because the genes theoretically can be 

30 transcribed and translated separately, a great number of combinations of multiple 

promoters is possible. Additionally, it would be most desirable to prevent the promoters 
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from interfering with one another. This could be achieved either by placing two 
promoters into the construct such that they direct transcription in opposite directions, or 
by inserting transcriptional terminator sequences downstream of each separately 
transcribed unit. The main advantage of the multiple-promoter construct is that it 
5 permits independent regulation of as many distinct units as desired, which could be 
important. The disadvantages are that it would be more difficult to construct; more 
difficult to amend later; and more difficult to effectively regulate, since multiple 
changes in fermentation conditions would need to be introduced and might render the 
performance of the fermentation somewhat less predicable. 

10 In any construct, the promoter sequence(s) used should be functional in the 

selected host organism and preferably provide sufficient transcription of the genes 
comprising the glycerol to 3 - hydroxypropionic acid pathway to enable the construct to 
be adequately active in that host. The promoter sequence(s) used would also effect 
regulation of transcription of the genes enabling the glycerol to 3 -HP pathway to be 

15 adequately active under the fermentation conditions employed for 3-HP production, and 
preferably they would be inducible, such that expression of the genes could be 
modulated by the inclusion in, or exclusion from, the fermentation of a certain agents or 
conditions. 

A plausible example of the use of such a construct follows: one promoter, which 
20 induced by the addition of an inexpensive chemical (the inducer) to the medium, could 
control transcription of both the dhaB gene and the gene encoding the appropriate 
aldehyde dehydrogenase. The cells would be permitted to grow in the absence of the 
inducer until they accumulated to a predetermined level. The inducer would then be 
added to the fermentation and nutritional changes commensurate with the altered 
25 metabolism would be made to the medium as well. The cells would then be permitted to 
utilize the substrate(s) provided for 3-HP production (and additional biomass production 
if desired). After the cells could no longer use substrate to produce 3-HP, the 
fermentation would be stopped and the 3-HP recovered. 

Genetic Sequences 

30 To express glycerol dehydratase and a suitable aldehyde dehydrogenase, the two 
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enzymes necessary for the production of 3 - hydroxypropionic acid from glycerol, it is 
required that the DNA sequences containing the glycerol dehydratase and aldehyde 
dehydrogenase coding sequences be combined with at least a promoter sequence 
(preferably a non-native promoter although some native promoter activity may be 
5 present). An exemplary method of construction is described in the example below. To 
ensure that the present specification is enabling, the full sequences of the coding regions 
of genes for these enzymes is presented here. 

Sequences 1, 3, 5 and 7 present different native genomic sequences for genes 
encoding aldehyde dehydrogenases. 
10 SEQ ID NO:l contains the full native DNA sequence encoding the ALD4 

enzyme from Saccharomyces cerevisiae. The amino acid sequence of the protein is 
(jy presented as SEQ ID NO:2. 

SEQ ID NO:3 includes the DNA sequence for the human ALDH2 gene, again 
including the full protein coding region. The amino acid sequence for this human 
15 alcohol dehydrogenase is presented in SEQ ID NO:4. 

SEQ ID NO:5 and 7 respectively present the full coding sequences from the E. 
coli genes aldA and aldB, both of which encode alcohol dehydrogenases. The amino 
acid sequences for the proteins encoded by the genes are presented in SEQ ID NO: 6 
and 8 respectively. 

20 SEQ ID NO:9 contains the native genomic DNA sequence for the dhaB gene 

£2 A from the dha regulon of Klebisiella pneumoniae. The coding sequences for this 

complex regulon produces five polypeptides, which are presented as SEQ ID NOS: 10 
through 13, which together provide the activity of the glycerol dehydratase enzyme. 

Each of these coding sequences can be used to make genetic constructs for the 

25 expression of the appropriate enzymes in a heterologous hosts. In making genetic 

constructs for expression of the genes in such hosts, it is contemplated that heterologous 
promoters will be joined to the coding sequences for the enzymes, but all that it required 
is that the promoters be effective for the hosts in which the genes are to be expressed. It 
is also contemplated and envisioned that significant variations in DNA sequence are 

30 possible from the native DNA coding sequences presented here. As is well known in 
the art, due to the degeneracy of the genetic code, many different DNA sequences can 
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encode the expression of the same protein. So, when this document uses language 
specifying a DNA sequence encoding a protein, it is intended to encompass any DNA 
sequence which can be used to express that protein even if different from the genomic 
sequences presented here. It is also contemplated that conservative changes in the 
5 amino acid sequences of the proteins specified here can be made without departing from 
the present invention. In particular, deletions, additions and substitutions of one or more 
amino acids in a protein sequence can almost always be made without changing protein 
functionality. When the name of a protein is sued here, it is intended to be equally 
applicable to both such minor changes in amino acid sequence and to allelic variations 
10 in native protein sequence as occurs within the species named as well as other closely 
related species. 

It is possible that many of the above DNA sequences could be truncated and still 
express a protein that has the same enzymatic properties. One skilled in the art of 
molecular biology would appreciate that minor deletions, additions and mutations may 

15 not change the attributes of the designated base pair sequences; many of the nucleotide 
of the designated base pair sequences are probably not essential for their unique 
function. To determine whether or not an altered sequence or sequences has sufficient 
homology with the designated base pairs to function identically, one would simply 
create the candidate mutation, deletion or alteration and create a gene construct 

20 including the altered sequence together with promoter and termination sequences. This 
gene construct could be tested as, described below, for the production of 3-HP from 
glycerol. 

Certain DNA primers were used to isolate or clone the genomic DNA sequences 
used in the experiments described below. While the sequence information presented 
25 here is sufficient to enable the construction of expression plasmids incorporating the 
genes identified here, in order to redundantly enable the use of these genes, primers 
which may be used to isolated the genes from their native hosts are described below. 

The primers aldA_L (SEQ ID NO: 14), and alcA_R (SEQ ID NO: 15), were used 
to amplify the 1513 bp aldA fragment from genomic E. coli DNA (strain MG1655, a 
30 gift from the Genetic Stock Center, New Haven, CT). The gel purified PCR fragment 
containing a DNA sequence coding for the expression of aldehyde dehydrogenase was 
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inserted into Ncol-Xhol site of pSE380 (Invitrogen, San Diego, CA) to give pPFS3. The 
resulting plasmid contained aldA under the control of the trc promoter. This construct 
allowed for high-level expression of the aldA gene from E. coli under regulation of the 
trc promoter. Unless indicated otherwise all molecular biology and plasmid 
5 constructions were done in E. coli AG1 (Stratagene, La Jolla, CA). 

The primers aldB_L (SEQ ID NO:20) and aldB_R (SEQ ID NO:21), were used 
to amplify the 1574 bp aldB fragment from genomic E. coli DNA (strain MG1655). 
The resulting PCR converted the TGA stop codon into a TAA stop codon. The gel- 
purified PCR fragment containing the DNA sequence sufficiently coding for the 
10 expression of aldehyde dehydrogenase was inserted into the Kpnl-Sacl site of pSE380 to 
givepPFS12. 

The primers ALD4_L (SEQ ID NO : 16), and ALD4_R (SEQ ID NO : 1 7), were 
used to amplify the 1595 bp ALD4 fragment from S, cerevisiae DNA (strain YPH500). 
The gel-purified fragment containing a DNA sequence coding for the expression of 

15 aldehyde dehydrogenase was inserted into the Kpnl-Sacl site of pPFS3 to give pPFS8. 
The resulting plasmid contained mature ALD4 under control of the trc promoter. 

The primers ALDH2_L (SEQ ID NO:18), and ALDH2_R (SEQ ID NO:19), 
were used to amplify the 1541 bp ALDH2 fragment from pT7-7::ALDH2, a gift from 
H. Weiner (Purdue University, West Lafayette, IN). The gel purified PCR fragment 

20 containing a DNA sequence sufficiently homologous to base pairs 22 to 1524, inclusive 
of SEQ ID NO : 3 so as to code for the expression of aldehyde dehydrogenase was 
inserted in to the Kpnl-Sacl site of pSE380 to give pPFS7. This sequence was moved 
from pPFS7 into the Kpnl-Sacl site of pPFS3 to give pPFS9. The resulting plasmid 
contained mature ALDH2 under the control of the trc promoter. 

25 The primers pTRC_L (SEQ ID NO:22), and pTRC_R )SEQ ID NO:23), were 

used to amplify the 540 bp fragment from pSE380. The gel purified PCR fragment was 
inserted into the Hpal-Kpnl site of pPFS3 to give pPFS13. The resulting plasmid 
deleted the "native" ribosome binding site of pSE380 and a Ncol site (which contained 
an extraneous ATG start codon upstream of the cloned genes). The Kpnl-Sacl 

30 fragments of pPFS8, pPFS9, and pPSF12 were inserted into the Kpnl-Sacl site of 
pPFS13 to give pPFS14, pPFS15, and pPFS16, respectively. 
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Assay for production of 3-HP 

The efficacy of changes made as contemplated herein can be checked by the 
following tests. To test for the production of 3 -HP, fermentation products can be 
quantified with a Waters Alliance Integrity HPLC system (Milford, MA) equipped with 
5 a refractive index detector, a photodiode array detector, and an Aminex HPX-87H (Bio- 
Rad, Hercules, CA) organic acids column. The mobile phase should be 0.01 N sulfuric 
acid solution (pH 2.0) at a flow rate of 0,5 mL/min. The column temperature should be 
set to 40 °C. Compounds can be identified by determining if they co-elute with 
authentic standards. Prior to analysis, all samples should be filtered through 0.45 |xM 
10 pore size membrane. (Gelman Sciences, Ann Arbor, MI). The fractions of the 

fermentation products collected using HPLC should be analyzed on a Varian Star 3400 
CX, gas - chromatograph coupled to a Varian Saturn 3 mass spectrometer (GC-MS) 
(Walnut Creek, CA). 

Assay for enzyme activity. 

1 5 Aldehyde dehydrogenase activity can be determined by measuring the reduction 

of P-NAD + at 25 °C with 3 - hydroxypropionaldehyde as a substrate. All buffers should 
contain 1 mM ethylenediaminetetraacetic acid (EDTA), 0. 1 mM Pefabloc SC 
(Boehringer Mannheim, Indianapolis, IN) and 1 mM Tris (carboxyethyl) phosphine 
hydrochloride (TCEP-HCL). For ALD4, the solution should contain 100 mM Tris HCL 

20 Buffer (pH 8.0), 100 mM KC1. For ALDH2 the solution should contained 100 mM 
sodium pyrophosphate (pH 9.0). For AldA and AldB, the solution should contain 20 
mM sodium glycine (pH 9.5). A total of 3.0 mL of buffer should be added to quartz 
cuvettes and allowed to equilibrate to assay temperature. From 5 to 20 jiL of cell extract 
should be added and background activity recorded after the addition of p-NAD + to a 

25 final concentration of 0.67 mM. The reaction should be started by the addition of 
substrate (either acetaldehyde, propionaldehyde, or 3 - hydroxypropionaldehyde) to a 
final concentration of 2 mM. Assay mixtures should be stirred with micro-stirrers 
during the assays. 

For aldehyde dehydrogenase activity assays, one unit is defined as the reduction 
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of 1 .0 nM of p-AND + per minute at 25 ° C. These reactions can be monitored by 
following the change in absorbence at 340 nm (A 340 ) at 25 °C on a Varain Carry- 1 Bio 
spectrophotometer (Sugar Land, TX). Total protein concentrations in the cell extracts 
can be determined using the Bradford assay method (Bio-Rad, Hercules, CA) with 
5 bovine serum albumin as the standard. 

EXAMPLES 

Plasmid constructions. 

Klebsiella pneumoniae expresses glycerol dehydratase, an enzyme that catalyzes 
the conversion of glycerol to 3 - hydroxypropionaldehyde, (dhaB) and 1,3 - 
10 propanediol oxidoreductase an enzyme that catalyzes the conversion of 3 - 

hydroxypropionaldehyde to 1,3 - propanediol respectively (the gene product from 
dhaT). A plasmid encoding these two genes was created and expressed in E. coli 
(plasmid pTC53). The dhaT gene was deleted from pTC53 to create pMH34. The 
resulting plasmid still contained the DNA sequence complementary to base pairs 330 to 
15 2153 inclusion of SEQ ID NO : 9, the complement of base pairs 2166 to 2591, 

inclusive, of SEQ ID NO : 9, and the complement of base pairs 3191 to 4858, inclusive, 
of SEQ ID NO : 9, so as to code for the expression of glycerol dehydratase. The 
fragment of DNA encoding these sequences was excised from pMH34 by cutting it with 
Sall-Xbal, and the resulting fragment was gel purified (the purified fragment was gift 
20 fromM. Hoffman of the University of Wisconsin -Madison). This DNA fragment was 
inserted into the SaR-Xbal site of pPFS13 to give pPFS17. 

The resulting plasmid contained both the aldA and dhaB genes under the control 
of the trc promoter. Similarity, the gel-purified Sall-Xbal fragment from pMH34 was 
inserted into the SaR-Xbal sites of pPFSH, pPFSIS, and pPFS16 to give pPFS18, 
25 pPFS19, and pPFS20, respectively. These plasmids contained ALD4, ALDH2, and 
aldB, respectively, as well as dhaB under the control of the trc promoter; in all of the 
constructs the dhaB gene were downstream of the gene encoding the aldehyde 
dehydrogenase. 
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Expression in E. coli. 

The efficacy of E. coli as a platform for the production of 3-HP from growth on 
glucose has been examined using a mathematical model developed for this purpose. The 
model was executed in two different ways assuming the conversion of one mole of 
5 glucose under either anaerobic or aerobic conditions either directly to 3-HP or to the 
production of 3-HP and ATP. The optimum yield under anaerobic conditions is 1 mole 
of 3-HP and 1 mole of lactate. The more realistic yield under anaerobic conditions is 
0.5 moles of 3-HP, 1 .5 moles of lactate and 1 mole of ATP. The optimum yield under 
aerobic conditions is 1 .9 moles of 3-HP and 0.3 moles of C0 2 . The realistic yield under 

10 aerobic conditions is 1.85 moles of 3-HP, 0,35 moles of C0 2 and 1 mole of ATP. 

The effect of 3-HP concentration on E. coli strain MG1655 growth was 
measured. Cells were grown on standard media with and without the addition of up to 
80g/L of 3-HP. The best fit of these data demonstrated that 3-HP was only 1 A times as 
inhibitory as lactic acid on the growth of E. coli. It is possible to economically produce 

15 lactic acid using E. coli, since 3-HP is only 1.4 times more inhibitory than lactic acid, 
it should be possible to use E. coli as a host for the commercial production of 3-HP. 

Media and growth conditions 

The standard media contained the following per liter: 6 g Na 2 HP0 4 , 3 g KH 2 P0 4 , 
1 g NH 4 C1, 0.5 g NaCl, 3 mg CaCl 2 , 5 g yeast extract (Difco Laboratories, Detroit, MI) 

20 and 2 mM MgS0 4 . When necessary to retain plasmids ampicillin (100 mg/mL) was 
added to the media. Isopropyl-P-thiogalactopyranoside (IPTG) was added in varying 
amounts to induce gene expression. AH fermentations were carried out in an incubator- 
shaker at 37 C and 200 rpm. Anaerobic fermentations were carried out in 500-mL 
anaerobic flasks with 300 mL of working volume. Inocula for fermentations were 

25 grown overnight in Luria-Bertani medium supplemented with ampicillin is necessary. 
The 300-mL fermentations were inoculated with 1.5 mL of the overnight culture. For 
en2yme assays, fermentations were incubated for 24 hours. 

Over expression of aldehyde dehydrogenase in E. coli. 

Cells were harvested by centrifugation at 3000 x g for 10 minutes at 4°C with a 
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Beckman (Fullerton, CA) model J2-21 centrifuge. Cell pellets were washed twice in 
100 mM potassium phosphate buffer at pH 7.2 and re-suspended in appropriate assay 
resuspension buffer equal to 5 x of the volume of the wet cell mass. The cells were 
homogenized using a French pressure cell. The homogenate was centrifuged at 40000 x 
5 g for 30 minutes. The supernatant was dialyzed against the appropriate resuspension 
buffer using 10000 molecular weight cut-off pleated dialysis tubing (Pierce, Rockford, 
IL) at 4°C. Dialysis buffer was changed after 2 hours, and 4 hours, and dialysis was 
stopped after being allowed to proceed overnight. 

E. coli AG1 cells transfected with the plasmids constructed to express the aldA, 

10 ALD4 y ALDH2, or aldB genes were grown in 500-mL anaerobic flasks. Twelve hours 
after the fermentations were inoculated IPTG was added to induce enzyme expression. 
The cells were allowed to grow for an additional 12 hours then harvested and lysed as 
discussed above. The soluble fraction of the lysate was assayed for aldehyde 
dehydrogenase activity using the substrate 3-hydroxypropionicaldehyde in the buffer 

15 appropriate for the particular enzyme expressed The plasmid, aldehyde dehydrogenase 
expressed and specific activity measured (U/mg of protein) were as follows: pPFS13, 
aldA, 0.2; pPFS14, ALD4 9 0.5, pPFSIS, ALDH2, 0.3; and pPFS16, aldB. 0.1. The 
control, E. coli strain AG1 harboring plasmid pSE380, encoded no exogenous aldehyde 
dehydrogenase activity and it had no detectable activity with 3-HP as substrate. It is 

20 clear from the activity assays that all four aldehyde dehydrogenases were expressed in 
E. coli. The aldehyde dehydrogenase cloned from Saccharomyces cerevisiae (ADH4) 
had the highest activity when 3-hydroxypropionaldehyde was used as the substrate (0.5 
units/mg of protein). 

E. coli cells transformed with plasmids expressing: aldehyde dehydrogenase; 

25 both aldehyde dehydrogenase and glycerol dehydratase, or neither gene; were grown 
and assayed for their ability to produce 3-HP from glycerol. The cells were grown on 
standard media supplemented with 6 jiM of Coenzyme B 12 , under anaerobic conditions 
in the absence of light (to protect the integrity of the Coenzyme B 12 necessary for DhaB 
activity). After 12 hours, IPTG was added to induce expression of the genes under the 

30 trc promoter at the same time 5g/L of glycerol was added. After 12 more hours of 

anaerobic fermentation the fermentation broth was assayed for 3 - HP by HPLC and GC, 
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the plasmid, aldehyde dehydrogenase gene expressed and g/L of 3- HP measured were 
as follows: pSF17, aldA, 0.031; pPSF 18 ALD4 9 0.173; and pPSFl9,ALDH2, 0.061. 
Cells expressing dhaB but no exogenous aldehyde dehydrogenase genes (plasmid 
pMH34) produced 0.0 1 5 g/L of 3 - HP. Cells expressing aldA, ALD4, ALDH2 or aldB 
5 but not dhaB (plasmids pPFS13, pPFS14, pPFSIS, pPFS16, respectively) all produced 
less then 0.005 g/L of 3-HP when the media the cells were growing in was 
supplemented with 2.5g/L of 3-hydroxypropionaldehyde. 

Other Hosts and Promoters 

Applications of the 3 - hydroxypropionic acid pathway such as the genetic 

10 constructs of the present invention can easily be expressed in other organisms. The 
required genes would need to be placed under control of an appropriate promoter or 
promoters. Some organism such as yeasts may require transcription terminators to be 
placed after each transcribed unit. The knowledge of the present intention makes such 
amendments possible. Such a genetic construct would need to be part of a vector that 

15 could either replicate in the new host or integrate into the chromosome of the new host. 
Many such vectors are commercially available for expression in gram-negative and 
gram-positive bacteria, yeast, mammalian cells, insect cell, plant, etc. For example, to 
express the 3 -hydroxypropionic acid pathway in Rhodobacter capsulatus, one could 
obtain vector pNH2 from the American Type Culture Collection ( ATTC). This is a 

20 shuttle vector for use in R. capsulatus and E. coli. Organisms such as Saccharomyces 
cerevisiae which can convert glucose to glycerol could be used as a host, such a 
construct would enable the production of 3 - HP directly from glucose. Additionally, 
other substrates such as xylan could also be used given the selection of an appropriate 
host. 

25 Stochiometric analysis shows that best stochiometric yield of 3-HP production in 

E. coli calculated on the basis of glucose consumed is obtained under aerobic 
conditions. Under aerobic condition C0 2 is the only carbon-containing co-product, in 
particular the generation of lactic acid which occurs under anaerobic conditions is 
avoided. Production of 3-HP under these conditions could result in a more economical 

30 recovery of 3-HP from the fermentation broth. 
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Alternatively, the <i/*a£ gene and a gene encoding the appropriate aldehyde 
dehydrogenase could be cloned into the multiple cloning site of this vector in E. coli to 
facilitate construction, and then transformed into R. capsulatus. The R. capsulatus nifH 
promoter, provided on the plasmid, could be used to direct the transcription in R. 
5 capsulatus of the genes placed into pNF2 in series with one promoter, or with two 
copies of the nifH promoter. Expression of the genes in other organisms would require 
a procedure analogous to that presented here. 

Alternative Aldehyde Dehydrogenases and Glycerol Dehyd ratases 

Applications of the pathway for the production of 3-hydroxypropionic acid from 

10 glycerol can be made using other suitable aldehyde dehydrogenases. To be functional in 
this pathway an aldehyde dehydrogenase needs to be stable, readily expressed in the 
host of choice and have high enough activity towards 3-hydroxypropionaldehyde to 
enable it to make 3-HP. The knowledge of the present invention makes such 
amendments possible. A program of directed evolution could be undertaken to select 

15 for suitable aldehyde dehydrogenases or they could be recovered from native sources, 
the genes encoding these enzymes in conjunction with a gene encoding an appropriate 
glycerol dehydratase activity, would then be made part of any of the constructs 
envisioned here to produce 3 - hydroxypropionic acid from glycerol. 

A similar program of enzyme improvement including for example directed 

20 evolution could be carried out using the dhaB gene from Klebsiella pneumoniae as a 
starting point to obtain other variants of glycerol dehydratase that are superior in 
efficiency and stability to the form used in this invention. Alternatively, enzymes which 
catalyzes the same reaction may be isolated from others organisms and used in place of 
the Klebsiella pneumoniae glycerol dehydratase. Such enzymes may be especially 

25 useful in alternative hosts wherein they may be more readily expressed, be more stable 
and more efficient under the fermentation conditions best suited to the growth of the 
construct and the production and recovery of 3-HP. 
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SEQUENCE LISTING 

<no> supers, P atrick F 09/ 8' 3 075 T 

Cameron, Douglas C. 

<120> Production of 3-Hydroxypropionic Acid in Re COm bi nan t 

Organisms 

<130> 960296.96617 

<140> 09/830,751 
<141> 2000-08-30 

<150> 60/151,440 
<151> 1999-08-30 

<150> PCT/US00/23878 
<151> 2000-08-30 

<160> 23 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1529 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (25) . . (1509) 

<400> 1 

gtcgcggtac caaggaggta teat atg tea eae ett eet atg aea gtg ect 51 

Met Ser His Leu Pro Met Thr Val Pro 
1 5 

ate aag ctg eee aat ggg ttg gaa tat gag caa eea aeg ggg ttg tte 99 
lie Lys Leu Pro Asn Gly Leu Glu Tyr Glu Gln Pro Th " Gly £ Phe " 



10 

25 



" 20 



ate aac aac aag ttt gtt ect tet aaa eag aac aag ace tte gaa gte 
He Asn Asn Lys Phe Val Pro Ser Lys Gin Asn Lys Thr Phe Glu Val 



147 



30 35 



Til T 5,33 933 t9t Cat att ta t gaa ggt aga 

He Asn Pro Ser Thr Glu Glu Glu lie Cys His He Tyr Glu Gly Arg 



40 

aga 195 



50 55 
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gag gac gat gtg gaa gag gec gtg cag gec gec gac cgt gec ttc tct 
Glu Asp Asp Val Glu Glu Ala Val Gin Ala Ala Asp Arg Ala Phe Ser 
60 65 70 



243 



aat ggg tct tgg aac ggt ate gac cct att gac agg ggt aag get ttg 
Asn Gly Ser Trp Asn Gly lie Asp Pro lie Asp Arg Gly Lys Ala Leu 
75 80 85 



291 



tac agg tta gec gaa tta att gaa cag gac aag gat gtc att get tec 
Tyr Arg Leu Ala Glu Leu He Glu Gin Asp Lys Asp Val He Ala Ser 
90 95 100 105 



339 



ate gag act ttg gat aac ggt aaa get ate tct tec teg aga gga gat 
.He Glu Thr Leu Asp Asn Gly Lys Ala He Ser Ser Ser Arg Gly Asp 
110 115 120 



387 



gtt gat tta gtc ate aac tat ttg aaa tct tct get ggc ttt get gat 
Val Asp Leu Val He Asn Tyr Leu Lys Ser Ser Ala Gly Phe Ala Asp 
125 130 135 



435 



aaa att gat ggt aga atg att gat act ggt aga ace cat ttt tct tac 
Lys He Asp Gly Arg Met He Asp Thr Gly Arg Thr His Phe Ser Tyr 
140 145 150 



483 



act aag aga cag cct ttg ggt gtt tgt ggg cag att att cct tgg aat 
Thr Lys Arg Gin Pro Leu Gly Val Cys Gly Gin He He Pro Trp Asn 
155 160 165 



531 



ttc cca ctg ttg atg tgg gec tgg aag att gee cct get ttg gtc acc 
Phe Pro Leu Leu Met Trp Ala Trp Lys He Ala Pro Ala Leu Val Thr 
170 175 180 185 



579 



ggt aac acc gtc gtg ttg aag act gec gaa tec acc cca ttg tec get 
Gly Asn Thr Val Val Leu Lys Thr Ala Glu Ser Thr Pro Leu Ser Ala 
190 195 200 



627 



ttg tat gtg tct aaa tac ate cca cag gcg ggt att cca cct ggt gtg 
Leu Tyr Val Ser Lys Tyr lie Pro Gin Ala Gly He Pro Pro Gly Val 
205 210 215 



675 



ate aac att gta tec ggg ttt ggt aag att gtg gtt gag gec att aca 
He Asn He Val Ser Gly Phe Gly Lys He Val Val Glu Ala He Thr 
220 225 230 



723 



aac cat cca aaa ate aaa aag gtt gee ttc aca ggg tec acg get acg 
Asn His Pro Lys He Lys Lys Val Ala Phe Thr Gly Ser Thr Ala Thr 
235 240 245 



771 
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ggt aga cac att tac cag tec gca gec gca ggc ttg aaa aaa gtg act 819 

Gly Arg His lie Tyr Gin Ser Ala Ala Ala Gly Leu Lys Lys Val Thr 

250 255 260 265 

\ 

ttg gag ctg ggt ggt aaa tea cca aac att gtc ttc gcg gac gec gag- 867 

Leu Glu Leu Gly Gly Lys Ser Pro Asn He Val Phe Ala Asp Ala Glu 
270 275 280 

ttg aaa aaa gec gtg caa aac att ate ctt ggt ate tac tac aat tct 915 
Leu Lys Lys Ala Val Gin Asn He He Leu Gly He Tyr Tyr Asn Ser 
285 290 295 

ggt gag gtc tgt tgt gcg ggt tea agg gtg tat gtt gaa gaa tct att 963 
Gly Glu Val Cys Cys Ala Gly Ser Arg Val Tyr Val Glu Glu Ser He 
300 305 310 

tac gac aaa ttc att gaa gag ttc aaa gee get tct gaa tec ate aag 1011 
Tyr Asp Lys Phe He Glu Glu Phe Lys Ala Ala Ser Glu Ser He Lys 
315 320 325 

gtg ggc gac cca ttc gat gaa tct act ttc caa ggt gca caa acc tct 1059 
Val Gly Asp Pro Phe Asp Glu Ser Thr Phe Gin Gly Ala Gin Thr Ser 
330 335 340 345 

caa-atg caa eta aac aaa ate ttg aaa tac gtt gac att ggt aag aat 1107 
Gin Met Gin Leu Asn Lys He Leu Lys Tyr Val Asp He Gly Lys Asn 
350 355 360 

gaa ggt get act ttg att acc ggt ggt gaa aga tta ggt age aag ggt 1155 
Glu Gly Ala Thr Leu He Thr Gly Gly Glu Arg Leu Gly Ser Lys Gly 
365 370 375 

tac ttc att aag cca act gtc ttt ggt gac gtt aag gaa gac atg aga 1203 
Tyr Phe He Lys Pro Thr Val Phe Gly Asp Val Lys Glu Asp Met Arg 
380 385 390 

att gtc aaa gag gaa ate ttt ggc cct gtt gtc act gta acc aaa ttc 1251 
He Val Lys Glu Glu He Phe Gly Pro Val Val Thr Val Thr Lys Phe 
395 400 405 

aaa tct gee gac gaa gtc att aac atg gcg aac gat tct gaa tac ggg 1299 
Lys Ser Ala Asp Glu Val He Asn Met Ala Asn Asp Ser Glu Tyr Gly 
410 415 420 425 

ttg get get ggt att cac acc tct aat att aat acc gec tta aaa gtg 1347 
Leu Ala Ala Gly He His Thr Ser Asn He Asn Thr Ala Leu Lys Val 
430 435 440 
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get gat aga gtt aat gcg 
Ala Asp Arg Val Asn Ala 
445 

ttc cac cac gca gtt cct 
Phe His His Ala Val Pro 
460 

agg gaa atg tct gtt gat 
Arg Glu Met Ser Val Asp 
475 

gtc cgt gec aaa ttg gac 
Val Arg Ala Lys Leu Asp 
490 495 



ggt acg gtc tgg ata aac 
Gly Thr Val Trp He Asn 
450 

ttc ggt ggg ttc aat gca 
Phe Gly Gly Phe Asn Ala 
465 

get tta caa aac tac ttg 
Ala Leu Gin Asn Tyr Leu 
480 485 

gagtaagagc tegaattege 



act tat aac gat 1395 
Thr Tyr Asn Asp 
455 

tct ggt ttg ggc 1443 

Ser Gly Leu Gly 

470 

caa gtt aaa gcg 1491 
Gin Val Lys Ala 



1529 



<210> 2 
<211> 495 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Ser His Leu Pro Met Thr Val Pro He Lys Leu Pro Asn Gly Leu 
15 10 15 

Glu Tyr Glu Gin Pro Thr Gly Leu Phe He Asn Asn Lys Phe Val Pro 
20 25 30 

Ser Lys Gin Asn Lys Thr Phe Glu Val He Asn Pro Ser Thr Glu Glu 
35 40 45 

Glu He Cys His He Tyr Glu Gly Arg Glu Asp Asp Val Glu Glu Ala 
50 55 60 

Val Gin Ala Ala Asp Arg Ala Phe Ser Asn Gly Ser Trp Asn Gly* He 
65 70 75 80 

Asp Pro He Asp Arg Gly Lys Ala Leu Tyr Arg Leu Ala Glu Leu He 
85 90 95 

Glu Gin Asp Lys Asp Val He Ala Ser He Glu Thr Leu Asp Asn Gly 
100 105 110 



Lys Ala He Ser Ser Ser Arg Gly Asp Val Asp Leu Val He Asn Tyr 
115 120 125 



Leu Lys Ser Ser Ala 
130 

Asp Thr Gly Arg Thr 
145 

Val Cys Gly Gin He 
165 

Trp Lys He Ala Pro 
180 

Thr Ala Glu Ser Thr 
195 

Pro Gin Ala Gly lie 
210 

Gly Lys He Val Val 
225 

Val Ala Phe Thr Gly 
245 

Ala Ala Ala Gly Leu 
260 

Pro Asn He Val Phe 
275 

He He Leu Gly He 
290 

Ser Arg Val Tyr Val 
305 

Phe Lys Ala Ala Ser 

325 

Ser Thr Phe Gin Gly 
340 

Leu Lys Tyr Val Asp 
355 

Gly Gly Glu Arg Leu 
370 



Gly Phe Ala Asp Lys He 
135 

His Phe Ser Tyr Thr Lys 
150 155 

He Pro Trp Asn Phe Pro 
170 

Ala Leu Val Thr Gly Asn 
185 

Pro Leu Ser Ala Leu Tyr 
200 

Pro Pro Gly Val He Asn 
215 

Glu Ala He Thr Asn His 
230 235 

Ser Thr Ala Thr Gly Arg 
250 

Lys Lys Val Thr Leu Glu 
265 

Ala Asp Ala Glu Leu Lys 
280 

Tyr Tyr Asn Ser Gly Glu 
295 

Glu Glu Ser He Tyr Asp 
310 315 

Glu Ser He Lys Val Gly 
330 

Ala Gin Thr Ser Gin Met 
345 

He Gly Lys Asn Glu Gly 
360 

Gly Ser Lys Gly Tyr Phe 
375 



Asp Gly Arg Met He 
140 

Arg Gin Pro Leu Gly 
160 

Leu Leu Met Trp Ala 
175 

Thr Val Val Leu Lys 
190 

Val Ser Lys Tyr He 
205 

He Val Ser Gly Phe 
220 

Pro Lys He Lys Lys 
240 

His He Tyr Gin Ser 
255 

Leu Gly Gly Lys Ser 
270 

Lys Ala Val Gin Asn 
285 

Val Cys Cys Ala Gly 
300 

Lys Phe lie Glu Glu 
320 

Asp Pro Phe Asp Glu 
335 

Gin Leu Asn Lys He 
350 

Ala Thr Leu He Thr 
365 

He Lys Pro Thr Val 
380 



Phe Gly Asp Val 
385 

Gly Pro Val Val 



Asn Met Ala Asn 
420 

Ser Asn lie Asn 
435 

Thr Val Trp lie 
450 

Gly Gly Phe Asn 
465 

Leu Gin Asn Tyr 



Lys Glu Asp Met 
390 

Thr Val Thr Lys 
405 

Asp Ser Glu Tyr 



Thr Ala Leu Lys 
440 

Asn Thr Tyr Asn 
455 

Ala Ser Gly Leu 
470 

Leu Gin Val Lys 
485 



Arg lie Val Lys 
395 

Phe Lys Ser Ala 
410 

Gly Leu Ala Ala 
425 

Val Ala Asp Arg 



Asp Phe His His 
460 

Gly Arg Glu Met 
475 

Ala Val Arg Ala 
490 



Glu Glu lie Phe 
400 

Asp Glu Val lie 
415 

Gly lie His Thr 
430 

Val Asn Ala Gly 
445 

Ala Val Pro Phe 



Ser Val Asp Ala 

- 480 

Lys Leu Asp 
495 



<210> 3 

<211> 1541 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (22) . . (1521) 

<400> 3 

gcggtaccaa ggagatatca t atg tea gec gec gec acc cag gec gtg cct 

Met Ser Ala Ala Ala Thr Gin Ala Val Pro 
1 5 10 

gec ccc aac cag cag ccc gag gtc ttc tgc aac cag att ttc ata aac 
Ala Pro Asn Gin Gin Pro Glu Val Phe Cys Asn Gin He Phe He Asn 
15 20 25 

aat gaa tgg cac gat gec gtc age agg aaa aca ttc ccc acc gtc aat 
Asn Glu Trp His Asp Ala Val Ser Arg Lys Thr Phe Pro Thr Val Asn 
30 35 40 



ccg tec act gga gag gtc ate tgt cag gta get gaa ggg gac aag gaa 
Pro Ser Thr Gly Glu Val He Cys Gin Val Ala Glu Gly Asp Lys Glu 
45 50 55 
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gat gtg gac aag gca cgt gaa ggc cgc ccg ggc gcc ttc cag ctg ggc 
Asp Val Asp Lys Ala Arg Glu Gly Arg Pro Gly Ala Phe Gin Leu Gly 
60 65 70 



243 



tea cct tgg cgc cgc atg gac gca tea cac age ggc egg ctg ctg aac 

Ser Pro Trp Arg Arg Met Asp Ala Ser His Ser Gly Arg Leu Leu Asn 

75 80 85 90 

cgc ctg gcc gat ctg ate gag egg gac egg ace tac ctg gcg gcc ttg 

Arg Leu Ala Asp Leu lie Glu Arg Asp Arg Thr Tyr Leu Ala Ala Leu 

95 100 105 



291 



339 



gag acc ctg gac aat ggc aag ccc tat gtc ate tec tac ctg gtg gat 
Glu Thr Leu Asp Asn Gly Lys Pro Tyr Val He Ser Tyr Leu Val Asp 
110 H5 120 



387 



ttg gac atg gtc etc aaa tgt etc egg tat tat gcc ggc tgg get gat 
Leu Asp Met Val Leu Lys Cys Leu Arg Tyr Tyr Ala Gly Trp Ala Asp 
125 130 135 



435 



aag tac cac ggg aaa acc ate ccc att gac gga gac ttc ttc age tac 
Lys Tyr His Gly Lys Thr He Pro He Asp Gly Asp Phe Phe Ser Tyr 
140 145 150 

aca cgc cat gaa cct gtg ggg gtg tgc ggg cag ate att ccg tgg aat 
Thr Arg His Glu Pro Val Gly Val Cys Gly Gin He He Pro Trp Asn 
155 160 165 170 

ttc ccg etc ctg atg caa gca tgg aag ctg ggc cca gcc ttg gca act 
Phe Pro Leu Leu Met Gin Ala Trp Lys Leu Gly Pro Ala Leu Ala Thr 
175 180 185 



483 



531 



579 



gga aac gtg gtt gtg atg aag gta get gag cag aca ccc etc acc gcc 
Gly Asn Val Val Val Met Lys Val Ala Glu Gin Thr Pro Leu Thr Ala 
190 195 200 



627 



etc tat gtg gcc aac ctg ate aag gag get ggc ttt ccc cct ggt gtg 
Leu Tyr Val Ala Asn Leu He Lys Glu Ala Gly Phe Pro Pro Gly Val 
205 210 215 



675 



gtc aac att gtg cct gga ttt ggc ccc acg get ggg gcc gcc att gcc 
Val Asn He Val Pro Gly Phe Gly Pro Thr Ala Gly Ala Ala He Ala 
220 225 230 



723 



tec cat gag gat gtg gac aaa gtg gca ttc aca ggc tec act gag att 
Ser His Glu Asp Val Asp Lys Val Ala Phe Thr Gly Ser Thr Glu He 
235 240 245 250 



771 
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ggc cgc gta ate cag gtt get get ggg age age aac etc aag aga gtg 819 
Gly Arg Val lie Gin Val Ala Ala Gly Ser Ser Asn Leu Lys Arg Val 
255 260 265 

ace ttg gag ctg ggg ggg aag age ccc aac ate ate atg tea gat gee 867 
Thr Leu Glu Leu Gly Gly Lys Ser Pro Asn He He Met Ser Asp Ala 
270 275 280 

gat atg gat tgg gee gtg gaa cag gec cac ttc gec ctg ttc ttc aac 915 
Asp Met Asp Trp Ala Val Glu Gin Ala His Phe Ala Leu Phe Phe Asn 
285 290 295 



cag ggc cag tgc tgc tgt gec ggc tec egg acc ttc gtg cag gag gac 
Gin Gly Gin Cys Cys Cys Ala Gly Ser Arg Thr Phe Val Gin Glu Asp 
300 305 310 



ttc aag acc ata gag gag gtt gtt ggg aga gee aac aat tec acg tac 
Phe Lys Thr He Glu Glu Val Val Gly Arg Ala Asn Asn Ser Thr Tyr 
415 420 425 

ggg ctg gec gca get gtc ttc aca aag gat ttg gac aag gec aat tac 
Gly Leu Ala Ala Ala Val Phe Thr Lys Asp Leu Asp Lys Ala Asn Tyr 
430 435 440 



963 



ate tat gat gag ttt gtg gtg egg age gtt gec egg gec aag tct egg 1011 
He Tyr Asp Glu Phe Val Val Arg Ser Val Ala Arg Ala Lys Ser Arg 
315 320 325 330 



1107 



gtg gtc ggg aac ccc ttt gat age aag acc gag cag ggg ccg cag gtg 1059 
Val Val Gly Asn Pro Phe Asp Ser Lys Thr Glu Gin Gly Pro Gin Val 
335 340 345 

gat gaa act cag ttt aag aag ate etc ggc tac ate aac acg ggg aag 
Asp Glu Thr Gin Phe Lys Lys He Leu Gly Tyr He Asn Thr Gly Lys 
350 355 360 

caa gag ggg gcg aag ctg ctg tgt ggt ggg ggc att get get gac cgt 1155 
Gin Glu Gly Ala Lys Leu Leu Cys Gly Gly Gly He Ala Ala Asp Arg 
365 370 375 

ggt tac ttc ate cag ccc act gtg ttt gga gat gtg cag gat ggc atg 1203 
Gly Tyr Phe He Gin Pro Thr Val Phe Gly Asp Val Gin Asp Gly Met 
380 385 390 

acc ate gee aag gag gag ate ttc ggg cca gtg atg cag ate ctg aag 1251 
Thr He Ala Lys Glu Glu He Phe Gly Pro Val Met Gin He Leu Lys 
395 400 405 410 



1299 



1347 
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ctg tec cag gec etc cag gcg ggc act gtg tgg gtc aac tgc tat gat 1395 

Leu Ser Gin Ala Leu Gin Ala Gly Thr Val Trp Val Asn Cys Tyr Asp 

445 450 455 

gtg ttt gga gec cag tea ccc ttt ggt ggc tac aag atg teg ggg agt 1443 

Val Phe Gly Ala Gin Ser Pro Phe Gly Gly Tyr Lys Met Ser Gly Ser 

460 465 470 

ggc egg gag ttg ggc gag tac ggg ctg cag gca tac act gaa gtg aaa 14 91 

Gly Arg Glu Leu Gly Glu Tyr Gly Leu Gin Ala Tyr Thr Glu Val Lys 
475 480 485 490 

act gtc aca gtc aaa gtg cct cag aag aac tcataagagc tegaattege 1541 
Thr Val Thr Val Lys Val Pro Gin Lys Asn 
495 500 



<210> 4 
<211> 500 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Ser Ala Ala Ala Thr Gin Ala Val Pro Ala Pro Asn Gin Gin Pro 
15 10 15 



Glu Val Phe Cys Asn Gin He Phe 
20 

Val Ser Arg Lys Thr Phe Pro Thr 
35 40 



He Asn Asn Glu Trp His Asp Ala 
25 30 

Val Asn Pro Ser Thr Gly Glu Val 
45 



He Cys Gin Val Ala Glu Gly Asp Lys Glu Asp Val Asp Lys Ala Arg 
50 55 60 

Glu Gly Arg Pro Gly Ala Phe Gin Leu Gly Ser Pro Trp Arg Arg Met 
65 70 75 80 

Asp Ala Ser His Ser Gly Arg Leu Leu Asn Arg Leu Ala Asp Leu He 
85 90 95 

Glu Arg Asp Arg Thr Tyr Leu Ala Ala Leu Glu Thr Leu Asp Asn Gly 
100 105 110 

Lys Pro Tyr Val He Ser Tyr Leu Val Asp Leu Asp Met Val Leu Lys 
115 120 125 



Cys Leu Arg Tyr Tyr Ala Gly Trp Ala Asp Lys Tyr His Gly Lys Thr 
130 135 140 



lie Pro lie Asp Gly Asp Phe Phe Ser Tyr Thr Arg His Glu Pro Val 
145 150 155 160 

Gly Val Cys Gly Gin lie lie Pro Trp Asn Phe Pro Leu Leu Met Gin 
165 170 175 

Ala Trp Lys Leu Gly Pro Ala Leu Ala Thr Gly Asn Val Val Val Met 
180 185 190 

Lys Val Ala Glu Gin Thr Pro Leu Thr Ala Leu Tyr Val Ala Asn Leu 
195 200 205 

He Lys Glu Ala Gly Phe Pro Pro Gly Val Val Asn He Val Pro Gly 
210 215 220 

Phe Gly Pro Thr Ala Gly Ala Ala He Ala Ser His Glu Asp Val Asp 
225 230 235 240 

Lys Val Ala Phe Thr Gly Ser Thr Glu He Gly Arg Val He Gin Val 
245 250 255 

Ala Ala Gly Ser Ser Asn Leu Lys Arg Val Thr Leu Glu Leu Gly Gly 
260 ♦ 265 270 

Lys Ser Pro Asn He He Met Ser Asp Ala Asp Met Asp Trp Ala Val 
275 280 285 

Glu Gin Ala His Phe Ala Leu Phe Phe Asn Gin Gly Gin Cys Cys Cys 
290 295 300 

Ala Gly Ser Arg Thr Phe Val Gin Glu Asp He Tyr Asp Glu Phe Val 
305 310 315 320 

Val Arg Ser Val Ala Arg Ala Lys Ser Arg Val Val Gly Asn Pro Phe 
325 330 335 

Asp Ser Lys Thr Glu Gin Gly Pro Gin Val Asp Glu Thr Gin Phe Lys 
340 345 350 

Lys He Leu Gly Tyr He Asn Thr Gly Lys Gin Glu Gly Ala Lys Leu 
355 1 360 365 



Leu Cys Gly Gly Gly He Ala Ala Asp Arg Gly Tyr Phe He Gin Pro 
370 375 380 



Thr Val Phe Gly Asp 

385 

lie Phe Gly Pro Val 
405 

Val Val Gly Arg Ala 
420 

Phe Thr Lys Asp Leu 
435 

Ala Gly Thr Val Trp 
450 

Pro Phe Gly Gly Tyr 
465 

Tyr Gly Leu Gin Ala 
485 



Val Gin Asp Gly Met Thr 
390 395 

Met Gin lie Leu Lys Phe 
410 

Asn Asn Ser Thr Tyr Gly 
425 

Asp Lys Ala Asn Tyr Leu 
440 

Val Asn Cys Tyr Asp Val 
455 

Lys Met Ser Gly Ser Gly 
470 475 

Tyr Thr Glu Val Lys Thr 
490 



lie Ala Lys Glu Glu 
400 

Lys Thr lie Glu Glu 
415 

Leu Ala Ala Ala Val 
430 

Ser Gin Ala Leu Gin 
445 

Phe Gly Ala Gin Ser 
460 

Arg Glu Leu Gly Glu 
480 

Val Thr Val Lys Val 
495 



Pro Gin Lys Asn 
500 



<210> 5 
<211> 1513 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (37) . . (1473) 

<400> 5 

gctaccatgg cttaaccggt accaaggaga tatcat atg tea gta ccc gtt caa 

Met Ser Val Pro Val Gin 
1 5 

cat cct atg tat ate gat gga cag ttt gtt acc tgg cgt gga gac gca 
His Pro Met Tyr lie Asp Gly Gin Phe Val Thr Trp Arg Gly Asp Ala 
10 15 20 



tgg att gat gtg gta aac cct get aca gag get gtc att tec cgc ata 
Trp lie Asp Val Val Asn Pro Ala Thr Glu Ala Val lie Ser Arg lie 
25 30 35 



ccc gat ggt cag gcc gag gat gcc cgt aag gca ate gat gca gca gaa 
Pro Asp Gly Gin Ala Glu Asp Ala Arg Lys Ala lie Asp Ala Ala Glu 
40 45 50 



198 



cgt gca caa cca gaa tgg gaa gcg ttg cct get att gaa cgc gcc agt 
Arg Ala Gin Pro Glu Trp Glu Ala Leu Pro Ala lie Glu Arg Ala Ser 
55 60 65 70 



246 



tgg ttg cgc aaa ate tec gcc ggg ate cgc gaa cgc gcc agt gaa ate 
Trp Leu Arg Lys lie Ser Ala Gly lie Arg Glu Arg Ala Ser Glu lie 
75 80 85 



294 



agt gcg ctg att gtt gaa gaa ggg ggc aag ate cag cag ctg get gaa 
Ser Ala Leu lie Val Glu Glu Gly Gly Lys lie Gin Gin Leu Ala Glu 
90 95 100 



342 



gtc gaa gtg get ttt act gcc gac tat ate gat tac atg gcg gag tgg 
Val Glu Val Ala Phe Thr Ala Asp Tyr lie Asp Tyr Met Ala Glu Trp 
105 110 115 



390 



gca egg cgt tac gag ggc gag att att caa age gat cgt cca gga gaa 
Ala Arg Arg Tyr Glu Gly Glu lie He Gin Ser Asp Arg Pro Gly Glu 
120 125 130 



438 



aat att ctt ttg ttt aaa cgt gcg ctt ggt gtg act acc ggc att ctg 
Asn He Leu Leu Phe Lys Arg Ala Leu Gly Val Thr Thr Gly He Leu 
135 140 145 150 



486 



ccg tgg aac ttc ccg ttc ttc etc att gcc cgc aaa atg get ccc get 
Pro Trp Asn Phe Pro Phe Phe Leu He Ala Arg Lys Met Ala Pro Ala 
155 160 165 



534 



ctt ttg acc ggt aat acc ate gtc att aaa cct agt gaa ttt acg aca 
Leu Leu Thr Gly Asn Thr He Val He Lys Pro Ser Glu Phe Thr Thr 
170 175^ 180 



582 



aac aat gcg att gca ttc gcc aaa ate gtc gat gaa ata ggc ctt ccg 
Asn Asn Ala He Ala Phe Ala Lys He Val Asp Glu He Gly Leu Pro 
185 190 195 



630 



cgc ggc gtg ttt aac ctt gta ctg ggg cgt ggt gaa acc gtt ggg caa 
Arg Gly Val Phe Asn Leu Val Leu Gly Arg Gly Glu Thr Val Gly Gin 
200 205 210 



678 



gaa ctg gcg ggt aac cca aag gtc gca atg gtc agt atg aca ggc age 
Glu Leu Ala Gly Asn Pro Lys Val Ala Met Val Ser Met Thr Gly Ser 
215 220 225 230 



726 
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gtc tct gca ggt gag aag ate atg gcg act gcg gcg aaa aac ate acc 774 
Val Ser Ala Gly Glu Lys lie Met Ala Thr Ala Ala Lys Asn He Thr 
235 240 245 

aaa gtg tgt ctg gaa ttg ggg ggt aaa gca cca get ate gta atg gac 822 
Lys Val Cys Leu Glu Leu Gly Gly Lys Ala Pro Ala He Val Met Asp 
250 255 260 

gat gee gat ctt gaa ctg gca gtc aaa gec ate gtt gat tea cgc gtc 
Asp Ala Asp Leu Glu Leu Ala Val Lys Ala He Val Asp Ser Arg Val 
265 270 275 

att aat agt ggg caa^gtg tgt aac tgt gca gaa cgt gtt tat gta cag 
He Asn Ser Gly Gin Val Cys Asn Cys Ala Glu Arg Val Tyr Val Gin 
280 285 290 

aaa ggc att tat gat cag ttc gtc aat egg ctg ggt gaa gcg atg cag 
Lys Gly He Tyr Asp Gin Phe Val Asn Arg Leu Gly Glu Ala Met Gin 
295 300 305 310 

gcg gtt caa ttt ggt aac ccc get gaa cgc aac gac att gcg atg ggg 
Ala Val Gin Phe Gly Asn Pro Ala Glu Arg Asn Asp He Ala Met Gly 
315 320 325 

ccg ttg att aac gec gcg gcg ctg gaa agg gtc gag caa aaa gtg gcg 
Pro Leu He Asn Ala Ala Ala Leu Glu Arg Val Glu Gin Lys Val Ala 
330 335 340 

cgc gca gta gaa gaa ggg gcg aga gtg gcg ttc ggt ggc aaa gcg gta 1110 
Arg Ala Val Glu Glu Gly Ala Arg Val Ala Phe Gly Gly Lys Ala Val 
345 350 355 

gag ggg aaa gga tat tat tat ccg ccg aca ttg ctg ctg gat gtt cgc 1158 
Glu Gly Lys Gly Tyr Tyr Tyr Pro Pro Thr Leu Leu Leu Asp Val Arg 
360 365 370 



cag gaa atg teg att atg cat gag gaa acc ttt ggc ccg gtg ctg cca 
Gin Glu Met Ser He Met His Glu Glu Thr Phe Gly Pro Val Leu Pro 
375 380 385 390 



agt gat tac ggc ctg acc tea tea ate tat acc caa aat ctg aac gtc 
Ser Asp Tyr Gly Leu Thr Ser Ser He Tyr Thr Gin Asn Leu Asn Val 
410 415 420 



870 



918 



966 



1014 



1062 



1206 



gtt gtc gca ttt gac acg ctg gaa gat get ate tea atg get aat gac 1254 
Val Val Ala Phe Asp Thr Leu Glu Asp Ala He Ser Met Ala Asn Asp 
395 400. 405 



1302 



gcg atg aaa gcc att aaa ggg ctg aag ttt ggt gaa act tac ate aac 
Ala Met Lys Ala lie Lys Gly .Leu Lys Phe Gly Glu Thr Tyr lie Asn 
425 430 435 



cgt gaa aac ttc gaa get atg caa ggc ttc cac gcc gga tgg cgt aaa 
Arg Glu Asn Phe Glu Ala Met Gin Gly Phe His Ala Gly Trp Arg Lys 
440 445 450 

tec ggt att ggc ggc gca gat ggt aaa cat ggc ttg cat gga tat ctg 
Ser Gly lie Gly Gly Ala Asp Gly Lys His Gly Leu His Gly Tyr Leu 
455 460 465 470 

cag acc cag gtg gtt tat tta cag tct taagagctcg aattcccgtc 
Gin Thr Gin Val Val Tyr Leu Gin Ser 
475 

gaeggctcta gaetcgageg 



<210> 6 
<211> 479 
<212> PRT 

<213> Escherichia coli 
<400> 6 

Met Ser Val Pro Val Gin His Pro Met Tyr lie Asp Gly Gin Phe Val 
1 5 10 15 

Thr Trp Arg Gly Asp Ala Trp lie Asp Val Val Asn Pro Ala Thr Glu 
20 25 30 

Ala Val lie Ser Arg lie Pro Asp Gly Gin Ala Glu Asp Ala Arg Lys 
35 40 45 

Ala lie Asp Ala Ala Glu Arg Ala Gin Pro Glu Trp Glu Ala Leu Pro 
50 55 60 

Ala lie Glu Arg Ala Ser Trp Leu Arg Lys lie Ser Ala Gly lie Arg 
65 70 75 80 

Glu Arg Ala Ser Glu lie Ser Ala Leu lie Val Glu Glu Gly Gly Lys 
85 90 95 

He Gin Gin Leu Ala Glu Val Glu Val Ala Phe Thr Ala Asp Tyr He 
100 105 HO 



Asp Tyr Met Ala Glu Trp Ala Arg Arg Tyr Glu Gly Glu He He Gin 
115 120 125 
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Ser Asp Arg Pro Gly Glu Asn lie Leu Leu Phe Lys Arg Ala Leu Gly 
130 135 140 

Val Thr Thr Gly lie Leu Pro Trp Asn ' Phe Pro Phe Phe Leu lie Ala 
145 150 155 160 

Arg Lys Met Ala Pro Ala Leu Leu Thr Gly Asn Thr lie Val lie Lys 
165 170 175 

Pro Ser Glu Phe Thr Thr Asn Asn Ala lie Ala Phe Ala Lys He Val 
180 185 190 

Asp Glu He Gly Leu Pro Arg Gly Val Phe Asn Leu Val Leu Gly Arg 
195 200 205 

Gly Glu Thr Val Gly Gin Glu Leu Ala Gly Asn Pro Lys Val Ala Met 
210 215 220 

Val Ser Met Thr Gly Ser Val Ser Ala Gly Glu Lys He Met Ala Thr 
225 230 235 240 

Ala Ala Lys Asn He Thr Lys Val Cys Leu Glu Leu Gly Gly Lys Ala 
245 250 255 

Pro Ala He Val Met Asp Asp Ala Asp Leu Glu Leu Ala Val Lys Ala 
260 265 270 

He Val Asp Ser Arg Val He Asn Ser Gly Gin Val Cys Asn Cys Ala 
275 280 285 

Glu Arg Val Tyr Val Gin Lys Gly He Tyr Asp Gin Phe Val Asn Arg 
290 295 300 

Leu Gly Glu Ala Met Gin Ala Val Gin Phe Gly Asn Pro Ala Glu Arg 
305 310 315 320 

Asn Asp He Ala Met Gly Pro Leu He Asn Ala Ala Ala Leu Glu Arg 
325 330 335 

Val Glu Gin Lys Val Ala Arg Ala Val Glu Glu Gly Ala Arg Val Ala 
340 345 350 

Phe Gly Gly Lys Ala Val Glu Gly Lys Gly Tyr Tyr Tyr Pro Pro Thr 
355 360 365 



Leu Leu Leu Asp Val Arg Gin Glu Met Ser He Met His Glu Glu Thr 
370 375 380 
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Phe Gly Pro Val Leu 
385 

lie Ser Met Ala Asn 
405 

Thr Gin Asn Leu Asn 
420 

Gly Glu Thr Tyr lie 
435 

His Ala Gly Trp Arg 
450 

Gly Leu His Gly Tyr 
465 



Pro Val Val Ala Phe Asp 
390 395 

Asp Ser Asp Tyr Gly Leu 
410 

Val Ala Met Lys Ala lie 
425 

Asn Arg Glu Asn Phe Glu 
440 

Lys Ser Gly lie Gly Gly 
455 

Leu Gin Thr Gin Val Val 
470 475 



Thr Leu Glu Asp Ala 
400 

Thr Ser Ser lie Tyr 
415 

Lys Gly Leu Lys Phe 
430 

Ala Met Gin Gly Phe 
445 

Ala Asp Gly Lys His 
460 

Tyr Leu Gin Ser 



<210> 7 
<211> 1574 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (22) . . (1557) 

<400> 7 

gcggtaccaa ggaggtatca t atg ace aat aat ccc cct tea gca cag att 51 

Met Thr Asn Asn Pro Pro Ser Ala Gin lie 
1 5 10 

aag ccc ggc gag tat ggt ttc ccc etc aag tta aaa gec cgc tat gac 99 
Lys Pro Gly Glu Tyr Gly Phe Pro Leu Lys Leu Lys Ala Arg Tyr Asp 
15 20 25 

aac ttt att ggc ggc gaa tgg gta gec cct gec gac ggc gag tat tac 147 
Asn Phe lie Gly Gly Glu Trp Val Ala Pro Ala Asp Gly Glu Tyr Tyr 
30 35 40 

cag aat ctg acg ccg gtg acc ggg cag ctg ctg tgc gaa gtg gcg tct 195 
Gin Asn Leu Thr Pro Val Thr Gly Gin Leu Leu Cys Glu Val Ala Ser 
45 50 55 



teg ggc aaa cga gac ate gat ctg gcg ctg gat get gcg cac aaa gtg 



243 
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Ser Gly Lys Arg Asp lie Asp Leu Ala Leu Asp Ala Ala His Lys Val 
60 65 70 

aaa gat aaa tgg gcg cac acc teg gtg cag gat cgt gcg gcg att ctg 
Lys Asp Lys Trp Ala His Thr Ser Val Gin Asp Arg Ala Ala He Leu 
75 80 85 90 



291 



ttt aag att gec gat cga atg gaa caa aac etc gag ctg tta gcg aca 
Phe Lys He Ala Asp Arg Met Glu Gin Asn Leu Glu Leu Leu Ala Thr 
95 100 105 



339 



get gaa acc tgg gat aac ggc aaa ccc att cgc gaa acc agt get gcg 
Ala Glu Thr Trp Asp Asn Gly Lys Pro He Arg Glu Thr Ser Ala Ala 
110 115 120 



387 



gat gta ccg ctg gcg att gac cat ttc cgc tat ttc gec teg tgt att 
Asp Val Pro Leu Ala He Asp His Phe Arg Tyr Phe Ala Ser Cys He 
125 130 135 



435 



egg gcg cag gaa ggt ggg ate agt gaa gtt gat age gaa acc gtg gee 

Arg Ala Gin Glu Gly Gly He Ser Glu Val Asp Ser Glu Thr Val Ala 

140 145 150 

tat cat ttc cat gaa ccg tta ggc gtg gtg ggg cag att ate ccg tgg 

Tyr His Phe His Glu Pro Leu Gly Val Val Gly Gin He He Pro Trp 

155 160 165 170 

aac ttc ccg ctg ctg atg gcg age tgg aaa atg get ccc gcg ctg gcg 

Asn Phe Pro Leu Leu Met Ala Ser Trp Lys Met Ala Pro Ala Leu Ala 

175 180 185 



483 



531 



579 



gcg ggc aac tgt gtg gtg ctg aaa ccc gca cgt ctt acc ccg ctt tct 
Ala Gly Asn Cys Val Val Leu Lys Pro Ala Arg Leu Thr Pro Leu Ser 
190 195 200 



627 



gta ctg ctg eta atg gaa att gtc ggt gat tta ctg ccg ccg ggc gtg 
Val Leu Leu Leu Met Glu He Val Gly Asp Leu Leu Pro Pro Gly Val 
205 210 215 

gtg aac gtg gtc aat ggc gca ggt ggg gta att ggc gaa tat ctg gcg 
Val Asn Val Val Asn Gly Ala Gly Gly Val He Gly Glu Tyr Leu Ala 
220 225 230 

acc teg aaa cgc ate gee aaa gtg gcg ttt acc ggc tea acg gaa gtg 
Thr Ser Lys Arg He Ala Lys Val Ala Phe Thr Gly Ser Thr Glu Val 
235 240 245 250 

ggc caa caa att atg caa tac gca acg caa aac att att ccg gtg acg 



675 



723 



771 



819 




Gly Gin Gin lie Met Gin Tyr Ala Thr Gin Asn He He Pro Val Thr 
255 260 265 

ctg gag ttg ggc ggt aag teg cca aat ate gtc ttt get gat gtg atg 
Leu Glu Leu Gly Gly Lys Ser Pro Asn He Val Phe Ala Asp Val Met 
270 275 280 

gat gaa gaa gat gec ttt ttc gat aaa gcg ctg gaa ggc ttt gca ctg 
Asp Glu Glu Asp Ala Phe Phe Asp Lys Ala Leu Glu Gly Phe Ala Leu 
285 290 295 

ttt gec ttt aac cag ggc gaa gtt tgc acc tgt ccg agt cgt get tta 
Phe Ala Phe Asn Gin Gly Glu Val Cys Thr Cys Pro Ser Arg Ala Leu 
300 305 310 

gtg cag gaa tct ate tac gaa cgc ttt atg gaa cgc gec ate cgc cgt 
Val Gin Glu Ser He Tyr Glu Arg Phe Met Glu Arg Ala He Arg Arg 
315 320 325 330 

gtc gaa age att cgt age ggt aac ccg etc gac age gtg acg caa atg 
Val Glu Ser He Arg Ser Gly Asn Pro Leu Asp Ser Val Thr Gin Met 
335 340 345 

ggc gcg cag gtt tct cac ggg caa ctg gaa acc ate etc aac tac att 
Gly Ala Gin Val Ser His. Gly Gin Leu Glu Thr He Leu Asn Tyr He 
350 355 360 

gat ate ggt aaa aaa gag ggc get gac gtg etc aca ggc ggg egg cgc 1155 
Asp He Gly Lys Lys Glu Gly Ala Asp Val Leu Thr Gly Gly Arg Arg 
365 370 375 

aag ctg ctg gaa ggt gaa ctg aaa gac ggc tac tac etc gaa ccg acg 1203 
Lys Leu Leu Glu Gly Glu Leu Lys Asp Gly Tyr Tyr Leu Glu Pro Thr 
380 385 390 

att ctg ttt ggt cag aac aat atg egg gtg ttc cag gag gag att ttt 1251 
He Leu Phe Gly Gin Asn Asn Met Arg Val Phe Gin Glu Glu He Phe 
395 400 405 410 

ggc ccg gtg ctg gcg gtg acc acc ttc aaa acg atg gaa gaa gcg ctg 1299 
Gly Pro Val Leu Ala Val Thr Thr Phe Lys Thr Met Glu Glu Ala Leu 
415 420 425 

gag ctg gcg aac gat acg caa tat ggc ctg ggc gcg ggc gtc tgg age 1347 
Glu Leu Ala Asn Asp Thr Gin Tyr Gly Leu Gly Ala Gly Val Trp Ser 
430 435 440 



867 



915 



963 



1011 



1059 



1107 



eg 



c aac ggt aat ctg gee tat aag atg ggg cgc ggc ata cag get ggg 1395 



Arg Asn Gly Asn Leu Ala Tyr Lys Met Gly Arg Gly lie Gin Ala Gly 
445 450 455 



cgc gtg tgg acc aac tgt tat cac get tac ccg gca cat gcg gcg ttt 
Arg Val Trp Thr Asn Cys Tyr His Ala Tyr Pro Ala His Ala Ala Phe 
460 465 470 

ggt ggc tac aaa caa tea ggt ate ggt cgc gaa acc cac aag atg atg 
Gly Gly Tyr Lys Gin Ser Gly lie Gly Arg Glu Thr His Lys Met Met 
475 480 485 490 

ctg gag cat tac cag caa acc aag tgc ctg ctg gtg age tac teg gat 
Leu Glu His Tyr Gin Gin Thr Lys Cys Leu Leu Val Ser Tyr Ser Asp 
495 500 505 

aaa ccg ttg ggg ctg ttc taagagctcg aattege 
Lys Pro Leu Gly Leu Phe 
510 



<210> 8 
<211> 512 
<212> PRT 

<213> Escherichia coli 
<400> 8 

Met Thr Asn Asn Pro Pro Ser Ala Gin lie Lys Pro Gly Glu Tyr Gly 
15 10 15 

Phe Pro Leu Lys Leu Lys Ala Arg Tyr Asp Asn Phe lie Gly Gly Glu 
20 25 30 

Trp Val Ala Pro Ala Asp Gly Glu Tyr Tyr Gin Asn Leu Thr Pro Val 
35 40 45 

Thr Gly Gin Leu Leu Cys Glu Val Ala Ser Ser Gly Lys Arg Asp lie 
50 55 60 

Asp Leu Ala Leu Asp Ala Ala His Lys Val Lys Asp Lys Trp Ala His 
65 70 75 80 

Thr Ser Val Gin Asp Arg Ala Ala lie Leu Phe Lys lie Ala Asp Arg 
85 90 95 

Met Glu Gin Asn Leu Glu Leu Leu Ala Thr Ala Glu Thr Trp Asp Asn 
100 105 110 



Gly Lys Pro lie Arg Glu Thr Ser Ala Ala Asp Val Pro Leu Ala lie 



115 



120 



125 



Asp His Phe Arg 
130 

lie Ser Glu Val 
145 

Leu Gly Val Val 



Ala Ser Trp Lys 
180 

Leu Lys Pro Ala 
195 

lie Val Gly Asp 
210 

Ala Gly Gly Val 
225 

Lys Val Ala Phe 



Tyr Ala Thr Gin 

260 

Ser Pro Asn lie 
275 

Phe Asp Lys Ala 
290 

Glu Val Cys Thr 
305 

Glu Arg Phe Met 



Gly Asn Pro Leu 
340 

Gly Gin Leu Glu 
355 

Gly Ala Asp Val 



Tyr Phe Ala Ser 
135 

Asp Ser Glu Thr 
150 

Gly Gin lie lie 
165 

Met Ala Pro Ala 



Arg Leu Thr Pro 
200 

Leu Leu Pro Pro 
215 

lie Gly Glu Tyr 
230 

Thr Gly Ser Thr 
245 

Asn lie lie Pro 



Val Phe Ala Asp 
280 

Leu Glu Gly Phe 
295 

Cys Pro Ser Arg 
310 

Glu Arg Ala lie 
325 

Asp Ser Val Thr 



Thr lie Leu Asn 
360 

Leu Thr Gly Gly 



Cys lie Arg Ala 
140 

Val Ala Tyr His 
155 

Pro Trp Asn Phe 
170 

Leu Ala Ala Gly 
185 

Leu Ser Val Leu 



Gly Val Val Asn 
220 

Leu Ala Thr Ser 
235 

Glu Val Gly Gin 
250 

Val Thr Leu Glu 

265 

Val Met Asp Glu 



Ala Leu Phe Ala 
300 

Ala Leu Val Gin 
315 

Arg Arg Val Glu 
330 

Gin Met Gly Ala 
345 

Tyr lie Asp lie 



Arg Arg Lys Leu 



Gin Glu Gly Gly 



Phe His Glu Pro 
160 

Pro Leu Leu Met 
175 

Asn Cys Val Val 
190 

Leu Leu Met Glu 
205 

Val Val Asn Gly 



Lys Arg lie Ala 
240 

Gin lie Met Gin 
255 

Leu Gly Gly Lys 
270 

Glu Asp Ala Phe 
285 

Phe Asn "Gin Gly 



Glu Ser lie Tyr 
320 

Ser lie Arg Ser 
335 

Gin Val Ser His 
350 

Gly Lys Lys Glu 
365 

Leu Glu Gly Glu 



370 



375 



380 



Leu Lys Asp Gly 
385 

Asn Met Arg Val 



Thr Thr Phe Lys 
420 

Gin Tyr Gly Leu 
435 

Tyr Lys Met Gly 
450 

Tyr His Ala Tyr 
465 

Gly lie Gly Arg 



Thr Lys Cys Leu 
500 



Tyr Tyr Leu Glu 
390 

Phe Gin Glu Glu 
405 

Thr Met Glu Glu 



Gly Ala Gly Val 
440 

Arg Gly lie Gin 

455 

Pro Ala His Ala 
470 

Glu Thr His Lys 
485 

Leu Val Ser Tyr 



Pro Thr lie Leu 
395 

lie Phe Gly Pro 
410 

Ala Leu Glu Leu 
425 

Trp Ser Arg Asn 



Ala Gly Arg Val 
460 

Ala Phe Gly Gly 
475 

Met Met Leu Glu 
4 90 

Ser Asp Lys Pro 
505 



Phe Gly Gin Asn 
400 

Val Leu Ala Val 
415 

Ala Asn Asp Thr 
430 

Gly Asn Leu Ala 
445 

Trp Thr Asn Cys 



Tyr Lys Gin Ser 
480 

His Tyr Gin Gin 
495 

Leu Gly Leu Phe 
510 



<210> 9 
<211> 5268 
<212> DNA 

<213> Klebsiella pneumoniae 
<220> 

<221> misc_feature 

<222> (300) . . (2153) 

<223> Location complement 

<220> 

<221> misc_feature 
<222> (2166) . . (2591) 
<223> Location complement 

<220> 

<221> misc_feature 
<222> (2594) . . (3034) 
<223> Location complement 



<220> 

<221> misc_f eature 
<222> (2191) . . (4858) 
<223> Location complement 

<400> 9 

agcgctatat gcgttgatgc aatttctatg cgcacccgtt ctcggagcac tgtccgaccg 60 
ctttggccgc cgcccagtcc tgctcgcttc gctacttgga gccactatcg actacgcgat 120 
catggcgacc acacccgtcc tgtggatctc ccactgacca aagctggccc cggcgacccg 180 
cagcgcagcg acgcagcccg cgccgaagaa aatgagcaat ccggtgccaa gaaactcggc 240 
cacgcactgc ccggttaagg tagaagtctg gttcattatc ggcatcctga aatagcacgt 300 
taaagagaga ggctggcgcg agcgcccgtt taattcgcct gaccggccag tagcagcccg 360 
gtggcgaccg cattgcgcgg cccttctgtt ccccgaatat tgccctgccc ggcgaccacg 420 
ccatagtgcg acaaggcttc cgtgataagc tgcgggatct caaagtccag cgatgagccg 4 80 
cccaccagca ccacaaaggc gatatcgcga atggaaccgc cgggtgagac ctggcgcagc 54 0 
gcgcgcaggc agttggtgac aaacactttc tctttcgcct gccggcgcac gagacgaatt 600 
ttttccagcg ggctggcgtt atcgatcggc accagttcgc cctccttgat gtacaccact 660 
ttggcgaaca ccgccgggct gagggcttcc cgaaagaact ccaccgcgcc attctcgtga 720 
cgaatactga acaggctttc cactttggcc agcgggtatt tttttatcgc ttccgccagc 780 
gaaagatcct cgaggcccag ctcggtttta atcaacaggc tgaccatatt ccccgccccg 840 
gcgagatgga ccgccgttat ctgcccctcc gcgttgacga tcgccgcatc cgtcgagccg 900 
gcgccgaggt cgaggatcgc cagcggcgcc gcacagccgg gagtggttaa cgccccggcg 960 
atggccatgt tggcctccac gccgcccacc accacctcgg tctgcagtcg ggcgctcagt 1020 
tcgcgggcga taacctgcat ttgcagacga tccgctttca ccatcgccgc catcccgacg 1080 
gcattctcca tggcgcactc gccggccatc ccgccctgca ccttgcgcgg aataaacgta 1140 
tccaccgcca gcagatcctg gatgtatatc gcgctcatct catggccggt cagggacgcc 1200 
attaccttgc gcacccgctc aagcatgccg ccggcgtggg tgcccggttc gccgcggatg 1260 
tcgcgtaccg gagcgcaggc gctcatcgcc tgcatgatgg cttccgcgcc ctcggcgaca 1320 
tcggcctctc cgcggcgctt ttcgccgcta atgtagaggt tgcccgccgg gatcacccgc 1380 
gactgcacat cccc'ctgcgg ggtcttgagc accaccgcgg aacggttgcc aatcagggcg 1440 
cgggcgatgg ggacgatggc ctgggtctct tccgggctta gcccgaagaa ggtggcgatc 1500 
ccgtagggat tcgacaggat ccgcaccacc tggcccggcg cggccacttc caccgccgcc 1560 
attaccccct cggggacctg ctccagcagc gtcacttcat ccaccaccgg cagggtttta 1620 
cgcaggcggt tgttcaccag cacgccgtcg tcctttttga ggatcgccgc caccacgttg 1680 
atcccccggt cgagcgcctc attgagccac cacacggcgt caaggaaatc gacggcgtcg 1740 
tcaatcagta cgatccaccc ctcggcatac tgcgccgccg gcagcgtcgc cagccgcccg 1800 
agggcgatag tcgtccccac gccaacgccc accccgcccg gcgtctgcgg gttatgaccg 1860 
atcatggtcg attcggtgat aatggtctcg gtgatggtct ccatcgccac atcgccaatc 1920 
accggcgcgg cttcgttaag atagatgcga gagacatcgc tcatcgacca cggtgttttc 1980 
gccagggcct gctccagcgc ggcgagggtc ccggcgatat tgtcccgcgt ccctttcatg 2040 
cccgtcgtcg cgacgatccc gctggcaaca aacgccctcg cctgcgggta gtcggacgcc 2100 
agcgccacct cggtggtggc gttgccgata tcaatcccgg ctattaacgg catgctgacc 2160 
tccgcttagc ttcctttacg cagcttatgc cgctgctgat acacttccgc cgactcccgg 2220 
acaaaggcgg cattcactgt cgcatgccag gtgtgctcca gctcgtcggc gatcgccagc 2280 
agctccgcct gcgaggagcg gaacgggcgc agcgcgttat agatagccag aatgcgctcg 2340 
tcaggaatgg cgataagctc cgccgcgcgg cggaaattgc gcgccaccgc atggcgctgc 2400 
atctgctcgg caatctgcgc ctggtactca agggtctggc gggagatccg cacatcctgc 2460 
gggcccacct cgccagagag caccttctcg agggtaatat cggtcaatgg tttgccggta 2520 



o gie Bier:?'': 1 ! a ^cra 



ggcgtcagga tatgetcegg geagegggtg 
ttctcgctca tggtcactcc cttactaagt 
gcaccacatg tttggtctct ttgatatgaa 
gcaccatctg atcgttcacc accggcaccg 
cagcgttttt gccaatctgc eggtaggtet 
gctccaggtt getgagegge ageagatege 
ggataccgat gccgatcccc gagccgctca 
cgtcggacgt gcgcagaatg cgcaccaccc 
caatcagctc tttgaggatc gcgccatggg 
tatcgaaggc agggecgacg ccgatcacca 
ccccgccctc gcgggttttc agggtaaaag 
gaataccgcc ttattcaatg gtgtcgggct 
cccagcgttc ggcagagatg egatagcegg 
tgaccgcact caccacctcg aactgecgat 
cggtgacccg ctggcgcagc atattgagaa 
ggctcagcgc gecgacaata tegaggcegg 
cactcagatc ctccaccacg ttacgcggcg 
cggcctccac ctcctcgtcg gcgattggcg 
tcgcccgcgc cgctttctgg cgaatggcaa 
cgccgtcaac catcaggtca cgctgcagga 
agttcgagcc ggcgaacatg ttgtcgtagt 
agtcggtgcc cggcagcatc tgcatcaggg 
agaaagtctg gtcgttggcg gacgccactt 
ccgccagcac cgcccgaatg cccgacggca 
cgccgttttg cagtccctga accccggcgc 
ggtagagcat cgacttgctc tccgaatagc 
tgtagegcat tttcaacccg egggaggegt 
gegtatcate geegteggta aataccgett 
agctggttaa gccacgcatg cccagctcca 
tcaacacgcc ggggcggccg cactgcgaac 
cgtagcgcgc gataccgacc gtggtctcct 
cggcgtcagc ggcaatctgc aceggattat 
agggggtccg gcgggcacgc atettctgea 
gcgccatcac ctcgaccgct ttggccggcg 
cccggctgac gtgaatatcc accagcatac 
cctgctctgt gcgctcaacg ttgatcgcgt 
actggtcccg gcgtttgccg tccagttcga 
agacegggtc aaaggggctg tecatggega 
gcccgtcctg attgaegggg cgctgggcca 
ttcategget caaaaggtga aatcegcaga 
gccgcccggc cattaccggc aatagcggaa 
atttaatttc gtttcaattt ggcacacgaa 
ctccggcggc aaaaataaaa aatgtgatcg 
acgcccttaa ttacgttttt ccgacgctat 
gacaaaataa cgctgacaaa aataaagcaa 
cgccccctac tgactattcg cgccagcgtt 



getaaeggat aatcctgcac gcgcatggtt 2580 
cgatgtgcag ggtgacgggc tcggcgtcct 2 64 0 
atagegegge tttggccata aattteggee 2700 
gcgaaggtga ctctttgcgc geatagegeg 2760 
ccagcgtcag cagcggcgcc tgggagaaca 2820 
gctgatggat gaccgtggtc cccttcgact 2880 
ggttggccgc atcccaggcc ataaaggaga 2 94 0 
gggcgtgaag cccctcttct tccaccccgg 3000 
gcatatcgat cagagtgtga tgctggtgtt 3060 
cttcatcggc gcgttcatcg gcagaagcta 3120 
agggctgaat ttgggttgtc tgttgcacag 3180 
gaaccacgcc eggaatattt ttgatctccg 3240 
tgcccggccc ctgatagtca ttgatgtcgt 3300 
cgagaatggc egaggtctge aggtaatege 3360 
tattgetgge gatatcctca aagccgctgc 3420 
tgatgttgcg cttcatcatc tcttccaccg 3480 
geatctegtt gctgccgtgc gcgtaggtgg 3540 
gcagccccag etcgeggaaa accgcctgga 3600 
tggtttccgc ctcggtcacc ggaegcagge 3660 
tgttgtaatc atcaaaatct tccgcatcga 3720 
tcggcaccgc gctgtagccg gagaaaataa 3780 
tgcgcgcggt geggegaata tccgagtggg 3840 
cgaggtcgag catagaggcg atcaggtttt 3900 
cagcgccggt catgecgata cagctcaccg 3960 
ctttagtaat gaagatgcag cgegattega 4020 
ccatcagcgc ttcggatccg gtgeeggagg 4080 
aggecgagge gaggaacgee tttgaccacg 4140 
cggtgccgta gaccgacacc gtctcggcgt 4200 
gctcggtggc ctcttccacc gagcactgeg 4260 
cgaccaacag cgccagggcg ttaaacggcg 4320 
gttctgagaa gccgcggatc ccggcctcgg 4380 
ctttgagatt ggtgacgtgg cactggttgg 4440 
gcgccatcat catctccacc aegttcatet 4500 
tgatggcggt agtgatggca atgatctcct 4560 
gggctatttc caccgcctcc aggegcattg 4620 
aateggegat aaateggteg atcatgtcaa 4680 
cgatcagacc gttgtccact tttactgaag 4740 
tcagcccctc ttcaggccac tcgccaatca 4800 
gtactgcaaa tegttttgat cttttcattg 4860 
eggtagegaa tacgccgggc cagegtegtt 4920 
ctttaaatga gccagtggtg aaaaaaataa 4980 
atctaccgac agtttcacta tgaaacttta 5040 
cccgcaatga tataaatcaa ttaataaaaa 5100 
tttaacccta ttgactaaat catggeggge 5160 
gccaaccgaa tggtaatagt tttttactat 5220 
atcctggtgc gggagaga 5268 



<210> 10 
<211> 607 
<212> PRT 

<213> Klebsiella pneumoniae 
<400> 10 

Met Pro Leu lie Ala Gly lie Asp lie Gly Asn Ala Thr Thr Glu Val 
15 10 15 

Ala Leu Ala Ser Asp Tyr Pro Gin Ala Arg Ala Phe Val Ala Ser Gly 
20 25 30 

lie Val Ala Thr Thr Gly Met Lys Gly Thr Arg Asp Asn lie Ala Gly 
35 40 45 

Thr Leu Ala Ala Leu Glu Gin Ala Leu Ala Lys Thr Pro Trp Ser Met 
50 55 60 

Ser Asp Val Ser Arg lie Tyr Leu Asn Glu Ala Ala Pro Val lie Gly 
65 70 75 80 

Asp Val Ala Met Glu Thr lie Thr Glu Thr lie lie Thr Glu Ser Thr 
85 90 95 

Met lie Gly His Asn Pro Gin Thr Pro Gly Gly Val Gly Val Gly Val 
100 105 110 

Gly Thr Thr lie Ala Leu Gly Arg Leu Ala Thr Leu Pro Ala Ala Gin 
115 120 125 

Tyr Ala Glu Gly Trp lie Val Leu He Asp Asp Ala Val Asp Phe Leu 
130 135 140 

Asp Ala Val Trp Trp Leu Asn Glu Ala Leu Asp Arg Gly He Asn Val 
145 150 155 160 

Val Ala Ala He Leu Lys Lys Asp Asp Gly Val Leu Val Asn Asn Arg 
165 170 175 

Leu Arg Lys Thr Leu Pro Val Val Asp Glu Val Thr Leu Leu Glu Gin 
180 185 190 

Val Pro Glu Gly Val Met Ala Ala Val Glu Val Ala Ala Pro Gly Gin 
195 200 205 



Val Val Arg He Leu Ser Asn Pro Tyr Gly He Ala Thr Phe Phe Gly 
210 215 220 



O i{ 3B 3;.o ;z.s;jL 



Leu Ser Pro Glu Glu Thr Gin Ala lie Val Pro lie Ala Arg Ala Leu 
225 230 235 240 

He Gly Asn Arg Ser Ala Val Val Leu Lys Thr Pro Gin Gly Asp Val 
245 250 255 

Gin Ser Arg Val He Pro Ala Gly Asn Leu Tyr He Ser Gly Glu Lys 
260 265 270 

Arg Arg Gly Glu Ala Asp Val Ala Glu Gly Ala Glu Ala He Met Gin 
275 280 285 

Ala Met Ser Ala Cys Ala Pro Val Arg Asp He Arg Gly Glu Pro Gly 
290 295 300 

Thr His Ala Gly Gly Met Leu Glu Arg Val Arg Lys Val Met Ala Ser 
305 310 315 320 

Leu Thr Gly His Glu Met Ser Ala He Tyr He Gin Asp Leu Leu Ala 
325 330 335 

Val Asp Thr Phe He Pro Arg Lys Val Gin Gly Gly Met Ala Gly Glu 
340 345 350 

Cys Ala Met Glu Asn Ala Val Gly Met Ala Ala Met Val Lys Ala Asp 
355 360 365 

Arg Leu Gin Met Gin Val He Ala Arg Glu Leu Ser Ala Arg Leu Gin 
370 375 380 

Thr Glu Val Val Val Gly Gly Val Glu Ala Asn Met Ala He Ala Gly 
385 390 395 400 

Ala Leu Thr Thr Pro Gly Cys Ala Ala Pro Leu Ala He Leu Asp Leu 
405 410 415 

Gly Ala Gly Ser Thr Asp Ala Ala He Val Asn Ala Glu Gly Gin He 
420 425 430 

Thr Ala Val His Leu Ala Gly Ala Gly Asn Met Val Ser Leu Leu He 
435 440 445 



Lys Thr Glu Leu Gly Leu Glu Asp Leu Ser Leu Ala Glu Ala He Lys 
450 455 460 



Lys Tyr Pro Leu Ala Lys Val Glu Ser Leu Phe Ser He Arg His Glu 
465 470 475 480 



Asn Gly Ala Val 



Ala Lys Val Val 
500 

Ala Ser Pro Leu 
515 

Lys Val Phe Val 
530 

Gly Gly Ser lie 
545 

Ser Leu Asp Phe 



Tyr Gly Val Val 
580 

Arg Asn Ala Val 
595 



Glu Phe Phe Arg 
485 

Tyr He Lys Glu 



Glu Lys He Arg 
520 

Thr Asn Cys Leu 
535 

Arg Asp He Ala 

550 

Glu He Pro Gin 
565 

Ala Gly Gin Gly 



Ala Thr Gly Leu 
600 



Glu Ala Leu Ser 
490 

Gly Glu Leu Val 
505 

Leu Val Arg Arg 



Arg Ala Leu Arg 
540 

Phe Val Val Leu 
555 

Leu He Thr Glu 

570 

Asn He Arg Gly 
585 

Leu Leu Ala Gly 



Pro Ala Val Phe 
495 

Pro He Asp Asn 
510 

Gin Ala Lys Glu 
525 

Gin Val Ser Pro 



Val Gly Gly Ser 
560 

Ala Leu Ser His 
575 

Thr Glu Gly Pro 
590 

Gin Ala Asn 
605 



<210> 11 
<211> 141 
<212> PRT 
<213> Klebsiell 

<400> 11 
Met Ser Glu Lys 
1 

Cys Pro Glu His 
20 

Thr Leu Glu Lys 
35 



pneumoniae 



Thr Met Arg Val 

5 

He Leu Thr Pro 



Val Leu Ser Gly 
40 



Gin Asp Tyr Pro 
10 

Thr Gly Lys Pro 
25 

Glu Val Gly Pro 



Leu Ala Thr Arg 
15 

Leu Thr Asp He 
30 

Gin Asp Val Arg 
45 



He Ser Arg Gin Thr Leu Glu Tyr Gin Ala Gin He Ala Glu Gin Met 

50 55 60 - 

Gin Arg His Ala Val Ala Arg Asn Phe Arg Arg Ala Ala Glu Leu He 
65 70 75 80 



Ala He Pro Asp Glu Arg He Leu Ala He Tyr Asn Ala Leu Arg Pro 
85 90 95 



Phe Arg Ser Ser Gin Ala 
100 

His Thr Trp His Ala Thr 
115 

Glu Val Tyr Gin Gin Arg 
130 



Glu Leu Leu Ala lie 
105 

Val Asn Ala Ala Phe 
120 

His Lys Leu Arg Lys 
135 



Ala Asp Glu Leu Glu 
110 

Val Arg Glu Ser Ala 
125 

Gly Ser 
140 



<210> 12 
<211> 146 
<212> PRT 
<213> Klebsiella 



pneumoniae 



<400> 12 

Met Pro His Gly 

1 

Glu Gly Leu His 
20 

Ser Phe Met Ala 
35 

He Gly He Gin 
50 

Leu Pro Leu Ser 
65 

Leu Glu Thr Tyr 



Lys Glu Ser Pro 
100 

Pro Lys Phe Met 
115 

His Val Val Gin 
130 



Ala He Leu Lys 
5 

Ala Arg Val Val 



Trp Asp Ala Ala 
40 

Ser Lys Gly Thr 
55 

Asn Leu Glu Leu 
70 

Arg Gin He Gly 

85 

Ser Pro Val Pro 



Ala Lys Ala Ala 
120 

Asp Ala Glu Pro 
135 



Glu Leu He Ala 
10 

Arg He Leu Arg 
25 

Asn Leu Ser Gly 



Thr Val lie His 
60 

Phe Ser Gin Ala 
75 

Lys Asn Ala Ala 
90 

Val Val Asn Asp 
105 

Leu Phe His He 



Val Thr Leu His 
140 



Gly Val Glu Glu 
15 

Thr Ser Asp Val 
30 

Ser Gly He Gly 
45 

Gin Arg Asp Leu 



Pro Leu Leu Thr 
80 

Arg Tyr Ala Arg 
95 

Gin Met Val Arg 
110 

Lys Glu Thr Lys 
125 

lie Asp Leu Val 



Arg Glu 
145 



<210> 13 
<211> 555 
<212> PRT 

<213> Klebsiella pneumoniae 



<400> 13 
Met Lys Arg Ser 
1 

Gin Asp Gly Leu 
20 

Asp Ser Pro Phe 
35 

He Val Glu Leu 
50 

Arg Phe He Ala 
65 

Met Arg Leu Glu 



Val Ser Arg Glu 
100 

Lys Ala Val Glu 
115 

Ala Leu Gin Lys 
130 

Val Thr Asn Leu 
145 

Glu Ala Gly lie 



Ala Arg Tyr Ala 
180 

Cys Gly Arg Pro 
195 



Lys Arg Phe Ala 
5 

He Gly Glu Trp 



Asp Pro Val Ser 
40 

Asp Gly Lys Arg 
55 

Asp Tyr Ala He 
70 

Ala Val Glu He 

85 

Glu He He Ala 



Val Met Ala Gin 
120 

Met Arg Ala Arg 
135 

Lys Asp Asn Pro 
150 

Arg Gly Phe Ser 
165 

Pro Phe Asn Ala 



Gly Val Leu Thr 
200 



Val Leu Ala Gin 
10 

Pro Glu Glu Gly 
25 

Ser Val Lys Val 



Arg Asp Gin Phe 
60 

Asn Val Glu Arg 
75 

Ala Arg Met Leu 
90 

He Thr Thr Ala 
105 

Met Asn Val Val 



Arg Thr Pro Ser 
140 

Val Gin He Ala 
155 

Glu Gin Glu Thr 
170 

Leu Ala Leu Leu 
185 

Gin Cys Ser Val 



Arg Pro Val Asn 
15 

Leu He Ala Met 
30 

Asp Asn Gly Leu 
45 

Asp Met He Asp 



Thr Glu Gin Ala 
80 

Val Asp lie His 
95 

He Thr Pro Ala 
110 

Glu Met Met Met 
125 

Asn Gin Cys His 



Ala Asp Ala Ala 
160 

Thr Val Gly He 
175 

Val Gly Ser Gin 
190 

Glu Glu Ala Thr 
205 



Glu Leu Glu Leu Gly Met Arg Gly Leu Thr Ser Tyr Ala Glu Thr Val 
210 215 220 



CP3I! 3 0"7'S :1 



Ser Val Tyr Gly Thr Glu Ala Val Phe Thr Asp Gly Asp Asp Thr Pro 
225 230 235 240 

Trp Ser Lys Ala Phe Leu Ala Ser Ala Tyr Ala Ser Arg Gly Leu Lys 
245 250 255 

Met Arg Tyr Thr Ser Gly Thr Gly Ser Glu Ala Leu Met Gly Tyr Ser 
260 - 265 270 

Glu Ser Lys Ser Met. Leu Tyr Leu Glu Ser Arg Cys lie Phe lie Thr 
275 280 285 

Lys Gly Ala Gly Val Gin Gly Leu Gin Asn Gly Ala Val Ser Cys lie 
290 295 300 

Gly Met Thr Gly Ala Val Pro Ser Gly He Arg Ala Val Leu Ala Glu 
305 310 315 320 

Asn Leu He Ala Ser Met Leu Asp Leu Glu Val Ala Ser Ala Asn Asp 
325 330 335 

Gin Thr Phe Ser His Ser Asp He Arg Arg Thr Ala Arg Thr Leu Met 
340 345 350 

Gin Met Leu Pro Gly Thr Asp Phe He Phe Ser Gly Tyr Ser Ala Val 
355 360 365 

Pro Asn Tyr Asp Asn Met Phe Ala Gly Ser Asn Phe Asp Ala Glu Asp 
370 375 380 

Phe Asp Asp Tyr Asn He Leu Gin Arg Asp Leu Met Val Asp Gly Gly 
385 390 395 400 

Leu Arg Pro Val Thr Glu Ala Glu Thr lie Ala lie Arg Gin Lys Ala 
405 410 415 

Ala Arg Ala He Gin Ala Val Phe Arg Glu Leu Gly Leu Pro Pro He 
420 425 430 

Ala Asp Glu Glu Val Glu Ala Ala Thr Tyr Ala His Gly Ser Asn Glu 
435 440 445 

Met Pro Pro Arg Asn Val Val Glu Asp Leu Ser Ala Val Glu Glu Met 
450 455 460 



Met Lys Arg Asn He Thr Gly Leu Asp He Val Gly Ala Leu Ser Arg 
465 470 475 480 



Ser Gly Phe Glu Asp He Ala Ser Asn He Leu Asn Met Leu Arg Gin 
485 490 495 



Arg Val Thr Gly Asp Tyr Leu Gin Thr Ser Ala He Leu Asp Arg Gin 
500 505 510 

Phe Glu Val Val Ser Ala Val Asn Asp He Asn Asp Tyr Gin Gly Pro 
515 520 525 

Gly Thr Gly Tyr Arg He Ser Ala Glu Arg Trp Ala Glu He Lys Asn 
530 535 540 

He Pro Gly Val Val Gin Pro Asp Thr He Glu 
545 550 555 



<210> 14 
<211> 56 
<212> DNA 

<213> Escherichia coli 
<400> 14 

gctaccatgg cttaaccggt accaaggaga tatcatatgt cagtacccgt tcaaca 

<210> 15 
<211> 59 
<212> DNA 

<213> Escherichia coli 
<400> 15 

gcctcgagtc tagagccgtc gacgggaatt cgagctctta agactgtaaa taaaccacc 

<210> 16 
<211> 46 
<212> DNA 

<213> Saccharomyces cerevasiae 
<400> 16 

gcggtaccaa ggaggtatca tatgttcagt agatctacgc tctgct 

<210> 17 
<211> 34 
<212> DNA 

<213> Saccharomyces cerevasiae 



<400> 17 
gcgaattcga 



gctcttactc gtccaatttg gcac 



34 



<210> 18 

<211> 35 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gcggtaccaa ggaggtatca tatgtcagcc gccgc 35 

<210> 19 
<211> 39 
<212> DNA 

<213> Homo sapiens 



<210> 20 
<211> 44 
<212> DNA 

<213> Escherichia coli 
<400> 20 

gcggtaccaa ggaggtatca tatgaccaat aatccccctt cage 44 

<210> 21 
<211> 38 
<212> DNA 

<213> Escherichia coli 



<400> 19 



gcgaattcga 



gctcttatga gttcttctga ggcactttg 



39 



<400> 21 



gcgaattcga gctcttagaa cagccccaac ggtttatc 



38 



<210> 22 



<211> 20 



<212> DNA 



<213> Escherichia coli 



<400> 22 

atcccgccgt taaccaccat 



20 



<210> 23 
<211> 34 
<212> DNA 

<213> Escherichia coli 



<400> 23 

gcggtaccat tgttatccgc tcacaattcc acac 



34 
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SEQUENCE LISTING 

<110> Suthers, Patrick F 
Cameron, Douglas C. 

<12 0> Production of 3-Hydroxypropionic Acid in Recombinant Organisms 

5 <130> UW960296. 96617 

<140> 
<141> 

<160> 23 

<170> Patentln Ver. 2.1 

10 <210> 1 

<211> 1529 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 
15 <221> CDS 

<222> (25) . . (1509) 

<400> 1 

gtcgcggtac caaggaggta teat atg tea cac ctt cct atg aca gtg cct 51 

Met Ser His Leu Pro Met Thr Val Pro 
20 i 5 

ate aag ctg ccc aat ggg ttg gaa tat gag caa cca acg ggg ttg ttc 99 
lie Lys Leu Pro Asn Gly Leu Glu Tyr Glu Gin Pro Thr Gly Leu Phe 
10 15 20 25 

ate aac aac aag ttt gtt cct tct aaa cag aac aag acc ttc gaa gtc 14 7 
25 lie Asn Asn Lys Phe Val Pro Ser Lys Gin Asn Lys Thr Phe Glu Val 

30 35 40 
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att aac cct tec acg gaa gaa gaa ata tgt cat att tat gaa ggt aga 195 
lie Asn Pro Ser Thr Glu Glu Glu lie Cys His lie Tyr Glu Gly Arg 
45 50 55 

gag gac gat gtg gaa gag gec gtg cag gec gec gac cgt gec ttc tct 243 
5 Glu Asp Asp Val Glu Glu Ala Val Gin Ala Ala Asp Arg Ala Phe Ser 
60 65 70 

aat ggg tct tgg aac ggt ate gac cct att gac agg ggt aag get ttg 291 
Asn Gly Ser Trp Asn Gly lie Asp Pro lie Asp Arg Gly Lys Ala Leu 
75 80 85 

10 tac agg tta gec gaa tta att gaa cag gac aag gat gtc att get tec 339 
Tyr Arg Leu Ala Glu Leu lie Glu Gin Asp Lys Asp Val lie Ala Ser 
90 95 100 105 

ate gag act ttg gat aac ggt aaa get ate tct tec teg aga gga gat 387 
He Glu Thr Leu Asp Asn Gly Lys Ala He Ser Ser Ser Arg Gly Asp 
15 110 115 120 

gtt gat tta gtc ate aac tat ttg aaa tct tct get ggc ttt get gat 435 
Val Asp Leu Val He Asn Tyr Leu Lys Ser Ser Ala Gly Phe Ala Asp 
125 130 135 

aaa att gat ggt aga atg att gat act ggt aga ace cat ttt tct tac 483 
20 Lys He Asp Gly Arg Met He Asp Thr Gly Arg Thr His Phe Ser Tyr 
140 145 150 

act aag aga cag cct ttg ggt gtt tgt ggg cag att att cct tgg aat 531 
Thr Lys Arg Gin Pro Leu Gly Val Cys Gly Gin He He Pro Trp Asn 
155 160 165 

25 ttc cca ctg ttg atg tgg gee tgg aag att gec cct get ttg gtc ace 579 
Phe Pro Leu Leu Met Trp Ala Trp Lys He Ala Pro Ala Leu Val Thr 
170 175 180 185 
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ggt aac acc gtc gtg ttg aag act gcc gaa tec acc cca ttg tec get 62 7 
Gly Asn Thr Val Val Leu Lys Thr Ala Glu Ser Thr Pro Leu Ser Ala 
190 195 200 

ttg tat gtg tct aaa tac ate cca cag gcg ggt att cca cct ggt gtg 675 
5 Leu Tyr Val Ser Lys Tyr lie Pro Gin Ala Gly lie Pro Pro Gly Val 
205 210 215 

ate aac att gta tec ggg ttt ggt aag att gtg gtt gag gcc att aca 723 
lie Asn lie Val Ser Gly Phe Gly Lys lie Val Val Glu Ala lie Thr 
220 225 230 

10 aac cat cca aaa ate aaa aag gtt gcc ttc aca ggg tec acg get acg 771 
Asn His Pro Lys lie Lys Lys Val Ala Phe Thr Gly Ser Thr Ala Thr 
235 240 245 

ggt aga cac att tac cag tec gca gcc gca ggc ttg aaa aaa gtg act 819 
Gly Arg His lie Tyr Gin Ser Ala Ala Ala Gly Leu Lys Lys Val Thr 
15 250 255 260 265 

ttg gag ctg ggt ggt aaa tea cca aac att gtc ttc gcg gac gcc gag 867 
Leu Glu Leu Gly Gly Lys Ser Pro Asn lie Val Phe Ala Asp Ala Glu 
270 275 280 

ttg aaa aaa gcc gtg caa aac att ate ctt ggt ate tac tac aat tct 915 
20 Leu Lys Lys Ala Val Gin Asn lie lie Leu Gly lie Tyr Tyr Asn Ser 
285 290 295 

ggt gag gtc tgt tgt gcg ggt tea agg gtg tat gtt gaa gaa tct att 963 
Gly Glu Val Cys Cys Ala Gly Ser Arg Val Tyr Val Glu Glu Ser lie 
300 305 310 

25 tac gac aaa ttc att gaa gag ttc aaa gcc get tct gaa tec ate aag 1011 
Tyr Asp Lys Phe lie Glu Glu Phe Lys Ala Ala Ser Glu Ser He Lys 
315 320 325 
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gtg ggc gac 
Val Gly Asp 
330 

caa atg caa 
5 Gin Met Gin 

gaa ggt get 
Glu Gly Ala 

10 tac ttc att 
Tyr Phe lie 
380 

att gtc aaa 
lie Val Lys 
15 395 

aaa tct gec 

Lys Ser Ala 
410 

ttg get get 

20 Leu Ala Ala 

get gat aga 
Ala Asp Arg 

25 ttc cac cac 
Phe His His 
460 



cca ttc gat 
Pro Phe Asp 
335 

eta aac aaa 
Leu Asn Lys 
350 

act ttg att 
Thr Leu lie 
365 

aag cca act 
Lys Pro Thr 

gag gaa ate 
Glu Glu lie 



gac gaa gtc 
Asp Glu Val 
415 

ggt att cac 
Gly lie His 
430 

gtt aat gcg 
Val Asn Ala 
445 

gca gtt cct 
Ala Val Pro 



gaa tct act ttc 
Glu Ser Thr Phe 

ate ttg aaa tac 
lie Leu Lys Tyr 
355 

acc ggt ggt gaa 
Thr Gly Gly Glu 
370 

gtc ttt ggt gac 
Val Phe Gly Asp 
385 

ttt ggc cct gtt 
Phe Gly Pro Val 
400 

att aac atg gcg 
lie Asn Met Ala 

acc tct aat att 
Thr Ser Asn lie 
435 

ggt acg gtc tgg 
Gly Thr Val Trp 
450 

ttc ggt ggg ttc 
Phe Gly Gly Phe 
465 



caa ggt gca caa 
Gin Gly Ala Gin 
340 

gtt gac att ggt 
Val Asp lie Gly 

aga tta ggt age 
Arg Leu Gly Ser 
375 

gtt aag gaa gac 
Val Lys Glu Asp 
390 

gtc act gta acc 
Val Thr Val Thr 
405 

aac gat tct gaa 
Asn Asp Ser Glu 
420 

aat acc gee tta 
Asn Thr Ala Leu 

ata aac act tat 
lie Asn Thr Tyr 
455 

aat gca tct ggt 
Asn Ala Ser Gly 
470 



acc tct 1059 
Thr Ser 
345 

aag aat 1107 

Lys Asn 

360 

aag ggt 1155 
Lys Gly 

atg aga 1203 
Met Arg 

aaa ttc 1251 
Lys Phe 

tac ggg 1299 
Tyr Gly 
425 

aaa gtg 1347 

Lys Val 

440 

aac gat 1395 
Asn Asp 

ttg ggc 1443 
Leu Gly 
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agg gaa atg tct gtt gat get tta caa aac tac ttg caa gtt aaa gcg 1491 
Arg Glu Met Ser Val Asp Ala Leu Gin Asn Tyr Leu Gin Val Lys Ala 
475 480 485 

gtc cgt gec aaa ttg gac gagtaagagc tegaattege 1529 
5 Val Arg Ala Lys Leu Asp 
490 495 



<210> 2 

<211> 495 

<212> PRT 

10 <213> Saccharomyces cerevisiae 

<400> 2 

Met Ser His Leu Pro Met Thr Val Pro lie Lys Leu Pro Asn Gly Leu 
15 10 15 

Glu Tyr Glu Gin Pro Thr Gly Leu Phe lie Asn Asn Lys Phe Val Pro 
15 20 25 30 

Ser Lys Gin Asn Lys Thr Phe Glu Val lie Asn Pro Ser Thr Glu Glu 
35 40 45 

Glu lie Cys His lie Tyr Glu Gly Arg Glu Asp Asp Val Glu Glu Ala 
50 55 60 

20 Val Gin Ala Ala Asp Arg Ala Phe Ser Asn Gly Ser Trp Asn Gly lie 
65 70 75 80 

Asp Pro lie Asp Arg Gly Lys Ala Leu Tyr Arg Leu Ala Glu Leu lie 
85 90 95 

Glu Gin Asp Lys Asp Val lie Ala Ser lie Glu Thr Leu Asp Asn Gly 
25 100 105 110 

Lys Ala lie Ser Ser Ser Arg Gly Asp Val Asp Leu Val lie Asn Tyr 
115 120 125 
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Leu Lys Ser Ser Ala Gly Phe Ala Asp Lys He Asp Gly Arg Met He 
130 135 140 

Asp Thr Gly Arg Thr His Phe Ser Tyr Thr Lys Arg Gin Pro Leu Gly 
145 150 155 160 

5 Val Cys Gly Gin He He Pro Trp Asn Phe Pro Leu Leu Met Trp Ala 

165 170 175 

Trp Lys He Ala Pro Ala Leu Val Thr Gly Asn Thr Val Val Leu Lys 
180 185 190 

Thr Ala Glu Ser Thr Pro Leu Ser Ala Leu Tyr Val Ser Lys Tyr He 
10 195 200 205 

Pro Gin Ala Gly He Pro Pro Gly Val He Asn He Val Ser Gly Phe 
210 215 220 

Gly Lys He Val Val Glu Ala He Thr Asn His Pro Lys He Lys Lys 
225 230 235 240 

15 Val Ala Phe Thr Gly Ser Thr Ala Thr Gly Arg His He Tyr Gin Ser 

245 250 255 

Ala Ala Ala Gly Leu Lys Lys Val Thr Leu Glu Leu Gly Gly Lys Ser 
260 265 270 

Pro Asn He Val Phe Ala Asp Ala Glu Leu Lys Lys Ala Val Gin Asn 
20 275 280 285 

He He Leu Gly He Tyr Tyr Asn Ser Gly Glu Val Cys Cys Ala Gly 
290 295 300 

Ser Arg Val Tyr Val Glu Glu Ser He Tyr Asp Lys Phe He Glu Glu 
305 310 315 320 

25 Phe Lys Ala Ala Ser Glu Ser He Lys Val Gly Asp Pro Phe Asp Glu 

325 330 335 
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Ser Thr Phe Gin Gly Ala Gin Thr Ser Gin Met Gin Leu Asn Lys lie 
340 345 350 



Leu Lys Tyr Val Asp lie Gly Lys Asn Glu Gly Ala Thr Leu He Thr 
355 360 365 

5 Gly Gly Glu Arg Leu Gly Ser Lys Gly Tyr Phe He Lys Pro Thr Val 
370 375 380 

Phe Gly Asp Val Lys Glu Asp Met Arg He Val Lys Glu Glu He Phe 
385 390 395 400 

Gly Pro Val Val Thr Val Thr Lys Phe Lys Ser Ala Asp Glu Val He 
10 405 410 415 

Asn Met Ala Asn Asp Ser Glu Tyr Gly Leu Ala Ala Gly He His Thr 
420 425 430 

Ser Asn He Asn Thr Ala Leu Lys Val Ala Asp Arg Val Asn Ala Gly 
435 440 445 

15 Thr Val Trp He Asn Thr Tyr Asn Asp Phe His His Ala Val Pro Phe 
450 455 460 

Gly Gly Phe Asn Ala Ser Gly Leu Gly Arg Glu Met Ser Val Asp Ala 
465 470 475 480 

Leu Gin Asn Tyr Leu Gin Val Lys Ala Val Arg Ala Lys Leu Asp 
20 485 490 495 



<210> 3 

<211> 1541 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (22) . . (1521) 

<400> 3 

5 gcggtaccaa ggagatatca t atg tea gec gec gec acc cag gec gtg cct 51 

Met Ser Ala Ala Ala Thr Gin Ala Val Pro 
15 10 

gec ccc aac cag cag ccc gag gtc ttc tgc aac cag att ttc ata aac 99 
Ala Pro Asn Gin Gin Pro Glu Val Phe Cys Asn Gin lie Phe lie Asn 
10 15 20 25 

aat gaa tgg cac gat gec gtc age agg aaa aca ttc ccc acc gtc aat 147 
Asn Glu Trp His Asp Ala Val Ser Arg Lys Thr Phe Pro Thr Val Asn 
30 35 40 

ccg tec act gga gag gtc ate tgt cag gta get gaa ggg gac aag gaa 195 
15 Pro Ser Thr Gly Glu Val lie Cys Gin Val Ala Glu Gly Asp Lys Glu 
45 50 55 

gat gtg gac aag gca cgt gaa ggc cgc ccg ggc gec ttc cag ctg ggc 243 
Asp Val Asp Lys Ala Arg Glu Gly Arg Pro Gly Ala Phe Gin Leu Gly 
60 65 70 

20 tea cct tgg cgc cgc atg gac gca tea cac age ggc egg ctg ctg aac 291 
Ser Pro Trp Arg Arg Met Asp Ala Ser His Ser Gly Arg Leu Leu Asn 
75 80 85 90 

cgc ctg gec gat ctg ate gag egg gac egg acc tac ctg gcg gec ttg 33 9 
Arg Leu Ala Asp Leu lie Glu Arg Asp Arg Thr Tyr Leu Ala Ala Leu 
25 95 100 105 

gag acc ctg gac aat ggc aag ccc tat gtc ate tec tac ctg gtg gat 38 7 
Glu Thr Leu Asp Asn Gly Lys Pro Tyr Val lie Ser Tyr Leu Val Asp 
110 115 120 
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ttg gac atg gtc etc aaa tgt etc egg tat tat gec ggc tgg get gat 43 5 
Leu Asp Met Val Leu Lys Cys Leu Arg Tyr Tyr Ala Gly Trp Ala Asp 
125 130 135 

aag tac cac ggg aaa acc ate ccc att gac gga gac ttc ttc age tac 483 
5 Lys Tyr His Gly Lys Thr He Pro He Asp Gly Asp Phe Phe Ser Tyr 
140 145 150 

aca cgc cat gaa cct gtg ggg gtg tgc ggg cag ate att ccg tgg aat 531 
Thr Arg His Glu Pro Val Gly Val Cys Gly Gin He He Pro Trp Asn 
155 160 165 170 

10 ttc ccg etc ctg atg caa gca tgg aag ctg ggc cca gee ttg gca act 579 
^ y Phe Pro Leu Leu Met Gin Ala Trp Lys Leu Gly Pro Ala Leu Ala Thr 

175 180 185 

gga aac gtg gtt gtg atg aag gta get gag cag aca ccc etc acc gee 62 7 
Gly Asn Val Val Val Met Lys Val Ala Glu Gin Thr Pro Leu Thr Ala 
15 190 195 200 

etc tat gtg gee aac ctg ate aag gag get ggc ttt ccc cct ggt gtg 675 
Leu Tyr Val Ala Asn Leu He Lys Glu Ala Gly Phe Pro Pro Gly Val 
205 210 215 

( j gtc aac att gtg cct gga ttt ggc ccc acg get ggg gec gec att gee 723 



20 Val Asn He Val Pro Gly Phe Gly Pro Thr Ala Gly Ala Ala He Ala 

220 225 230 

tec cat gag gat gtg gac aaa gtg gca ttc aca ggc tec act gag att 771 
Ser His Glu Asp Val Asp Lys Val Ala Phe Thr Gly Ser Thr Glu He 
235 240 245 250 

25 ggc cgc gta ate cag gtt get get ggg age age aac etc aag aga gtg 819 
Gly Arg Val He Gin Val Ala Ala Gly Ser Ser Asn Leu Lys Arg Val 
255 260 265 
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acc ttg gag ctg ggg ggg aag age ccc aac ate ate atg tea gat gee 867 
Thr Leu Glu Leu Gly Gly Lys Ser Pro Asn He He Met Ser Asp Ala 
270 275 280 

gat atg gat tgg gee gtg gaa cag gee cac ttc gee ctg ttc ttc aac 915 
5 Asp Met Asp Trp Ala Val Glu Gin Ala His Phe Ala Leu Phe Phe Asn 
285 290 295 

cag ggc cag tgc tgc tgt gee ggc tec egg acc ttc gtg cag gag gac 963 
Gin Gly Gin Cys Cys Cys Ala Gly Ser Arg Thr Phe Val Gin Glu Asp 
300 305 310 

10 ate tat gat gag ttt gtg gtg egg age gtt gee egg gec aag tct egg 1011 
He Tyr Asp Glu Phe Val Val Arg Ser Val Ala Arg Ala Lys Ser Arg 
315 320 325 330 

gtg gtc ggg aac ccc ttt gat age aag acc gag cag ggg ccg cag gtg 1059 
Val Val Gly Asn Pro Phe Asp Ser Lys Thr Glu Gin Gly Pro Gin Val 
15 335 340 345 

gat gaa act cag ttt aag aag ate etc ggc tac ate aac acg ggg aag 1107 
Asp Glu Thr Gin Phe Lys Lys He Leu Gly Tyr He Asn Thr Gly Lys 
350 355 360 

caa gag ggg gcg aag ctg ctg tgt ggt ggg ggc att get get gac cgt 1155 
20 Gin Glu Gly Ala Lys Leu Leu Cys Gly Gly Gly He Ala Ala Asp Arg 
365 370 375 

ggt tac ttc ate cag ccc act gtg ttt gga gat gtg cag gat ggc atg 1203 
Gly Tyr Phe He Gin Pro Thr Val Phe Gly Asp Val Gin Asp Gly Met 
380 385 390 

25 acc ate gee aag gag gag ate ttc ggg cca gtg atg cag ate ctg aag 12 51 
Thr He Ala Lys Glu Glu He Phe Gly Pro Val Met Gin He Leu Lys 
395 400 405 410 
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ttc aag acc at a gag gag gtt gtt ggg aga gcc aac aat tec acg tac 12 99 
Phe Lys Thr lie Glu Glu Val Val Gly Arg Ala Asn Asn Ser Thr Tyr 
415 420 425 

ggg ctg gcc gca get gtc ttc aca aag gat ttg gac aag gcc aat tac 1347 
5 Gly Leu Ala Ala Ala Val Phe Thr Lys Asp Leu Asp Lys Ala Asn Tyr 
430 435 440 

ctg tec cag gcc etc cag gcg ggc act gtg tgg gtc aac tgc tat gat 1395 
Leu Ser Gin Ala Leu Gin Ala Gly Thr Val Trp Val Asn Cys Tyr Asp 
445 450 455 

10 gtg ttt gga gcc cag tea ccc ttt ggt ggc tac aag atg teg ggg agt 1443 
Val Phe Gly Ala Gin Ser Pro Phe Gly Gly Tyr Lys Met Ser Gly Ser 
460 465 470 

ggc egg gag ttg ggc gag tac ggg ctg cag gca tac act gaa gtg aaa 1491 
Gly Arg Glu Leu Gly Glu Tyr Gly Leu Gin Ala Tyr Thr Glu Val Lys 
15 475 480 485 490 

act gtc aca gtc aaa gtg cct cag aag aac tcataagagc tegaattege 1541 
Thr Val Thr Val Lys Val Pro Gin Lys Asn 
495 500 



<210> 4 
20 <211> 500 
<212> PRT 
<213> Homo sapiens 



<400> 4 

Met Ser Ala Ala Ala Thr Gin Ala Val Pro Ala Pro Asn Gin Gin Pro 
25 1 5 10 15 

Glu Val Phe Cys Asn Gin lie Phe He Asn Asn Glu Trp His Asp Ala 
20 25 30 
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Val Ser Arg Lys Thr Phe Pro Thr Val Asn Pro Ser Thr Gly Glu Val 
35 40 45 

lie Cys Gin Val Ala Glu Gly Asp Lys Glu Asp Val Asp Lys Ala Arg 
50 55 60 

5 Glu Gly Arg Pro Gly Ala Phe Gin Leu Gly Ser Pro Trp Arg Arg Met 
65 70 75 80 

Asp Ala Ser His Ser Gly Arg Leu Leu Asn Arg Leu Ala Asp Leu lie 
85 90 95 

Glu Arg Asp Arg Thr Tyr Leu Ala Ala Leu Glu Thr Leu Asp Asn Gly 
10 100 105 110 

Lys Pro Tyr Val He Ser Tyr Leu Val Asp Leu Asp Met Val Leu Lys 
115 120 125 

Cys Leu Arg Tyr Tyr Ala Gly Trp Ala Asp Lys Tyr His Gly Lys Thr 
130 135 140 

15 He Pro He Asp Gly Asp Phe Phe Ser Tyr Thr Arg His Glu Pro Val 
145 150 155 160 

Gly Val Cys Gly Gin He He Pro Trp Asn Phe Pro Leu Leu Met Gin 
165 170 175 

Ala Trp Lys Leu Gly Pro Ala Leu Ala Thr Gly Asn Val Val Val Met 
20 180 185 190 

Lys Val Ala Glu Gin Thr Pro Leu Thr Ala Leu Tyr Val Ala Asn Leu 
195 200 205 

He Lys Glu Ala Gly Phe Pro Pro Gly Val Val Asn He Val Pro Gly 
210 215 220 
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Phe Gly Pro Thr Ala Gly Ala Ala lie Ala Ser His Glu Asp Val Asp 
225 230 235 240 

Lys Val Ala Phe Thr Gly Ser Thr Glu lie Gly Arg Val lie Gin Val 
245 250 255 

5 Ala Ala Gly Ser Ser Asn Leu Lys Arg Val Thr Leu Glu Leu Gly Gly 
260 265 270 

Lys Ser Pro Asn lie lie Met Ser Asp Ala Asp Met Asp Trp Ala Val 
275 280 285 

Glu Gin Ala His Phe Ala Leu Phe Phe Asn Gin Gly Gin Cys Cys Cys 
10 290 295 300 

Ala Gly Ser Arg Thr Phe Val Gin Glu Asp lie Tyr Asp Glu Phe Val 
305 310 315 320 

Val Arg Ser Val Ala Arg Ala Lys Ser Arg Val Val Gly Asn Pro Phe 
325 330 335 

15 Asp Ser Lys Thr Glu Gin Gly Pro Gin Val Asp Glu Thr Gin Phe Lys 
340 345 350 

Lys lie Leu Gly Tyr lie Asn Thr Gly Lys Gin Glu Gly Ala Lys Leu 
355 360 365 

Leu Cys Gly Gly Gly lie Ala Ala Asp Arg Gly Tyr Phe lie Gin Pro 
20 370 375 380 

Thr Val Phe Gly Asp Val Gin Asp Gly Met Thr lie Ala Lys Glu Glu 
385 390 395 400 

lie Phe Gly Pro Val Met Gin lie Leu Lys Phe Lys Thr lie Glu Glu 
405 410 415 
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Val Val Gly Arg Ala Asn Asn Ser Thr Tyr Gly Leu Ala Ala Ala Val 
420 425 430 

Phe Thr Lys Asp Leu Asp Lys Ala Asn Tyr Leu Ser Gin Ala Leu Gin 
435 440 445 

5 Ala Gly Thr Val Trp Val Asn Cys Tyr Asp Val Phe Gly Ala Gin Ser 
450 455 460 

Pro Phe Gly Gly Tyr Lys Met Ser Gly Ser Gly Arg Glu Leu Gly Glu 
465 470 475 480 

Tyr Gly Leu Gin Ala Tyr Thr Glu Val Lys Thr Val Thr Val Lys Val 
10 485 490 495 

Pro Gin Lys Asn 
500 



<210> 5 

<211> 1512 

15 <212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (37) . . (1473) 

20 <400> 5 

gctaccatgg cttaaccggt accaaggaga tatcat atg tea gta ccc gtt caa 54 

Met Ser Val Pro Val Gin 
1 5 

cat cct atg tat ate gat gga cag ttt gtt acc tgg cgt gga gac gca 102 

25 His Pro Met Tyr lie Asp Gly Gin Phe Val Thr Trp Arg Gly Asp Ala 
10 15 20 
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tgg att gat gtg gta aac cct get aca gag get gtc att tec cgc ata 150 
Trp lie Asp Val Val Asn Pro Ala Thr Glu Ala Val lie Ser Arg lie 
25 30 35 

ccc gat ggt cag gee gag gat gec cgt aag gca ate gat gca gca gaa 198 
5 Pro Asp Gly Gin Ala Glu Asp Ala Arg Lys Ala He Asp Ala Ala Glu 
40 45 50 

cgt gca caa cca gaa tgg gaa gcg ttg cct get att gaa cgc gee agt 246 
Arg Ala Gin Pro Glu Trp Glu Ala Leu Pro Ala He Glu Arg Ala Ser 
55 60 65 70 

10 tgg ttg cgc aaa ate tec gec ggg ate cgc gaa cgc gec agt gaa ate 294 
Trp Leu Arg Lys lie Ser Ala Gly He Arg Glu Arg Ala Ser Glu He 
75 80 85 

agt gcg ctg att gtt gaa gaa ggg ggc aag ate cag cag ctg get gaa 342 
Ser Ala Leu He Val Glu Glu Gly Gly Lys He Gin Gin Leu Ala Glu 
15 90 95 100 

gtc gaa gtg get ttt act gee gac tat ate gat tac atg gcg gag tgg 3 90 
Val Glu Val Ala Phe Thr Ala Asp Tyr He Asp Tyr Met Ala Glu Trp 
105 110 115 

gca egg cgt tac gag ggc gag att att caa age gat cgt cca gga gaa 438 
20 Ala Arg Arg Tyr Glu Gly Glu He He Gin Ser Asp Arg Pro Gly Glu 
120 125 130 

aat att ctt ttg ttt aaa cgt gcg ctt ggt gtg act ace ggc att ctg 486 
Asn He Leu Leu Phe Lys Arg Ala Leu Gly Val Thr Thr Gly He Leu 
135 140 145 150 

25 ccg tgg aac ttc ccg ttc ttc etc att gec cgc aaa atg get ccc get 534 
Pro Trp Asn Phe Pro Phe Phe Leu He Ala Arg Lys Met Ala Pro Ala 
155 160 165 
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ctt ttg acc ggt aat acc ate gtc att aaa cct agt gaa ttt acg aca 582 
Leu Leu Thr Gly Asn Thr lie Val lie Lys Pro Ser Glu Phe Thr Thr 
170 175 180 

aac aat gcg att gca ttc gec aaa ate gtc gat gaa ata ggc ctt ccg 630 
5 Asn Asn Ala lie Ala Phe Ala Lys lie Val Asp Glu lie Gly Leu Pro 
185 190 195 

cgc ggc gtg ttt aac ctt gta ctg ggg cgt ggt gaa acc gtt ggg caa 678 
Arg Gly Val Phe Asn Leu Val Leu Gly Arg Gly Glu Thr Val Gly Gin 
200 205 210 

10 gaa ctg gcg ggt aac cca aag gtc gca atg gtc agt atg aca ggc age 726 
Glu Leu Ala Gly Asn Pro Lys Val Ala Met Val Ser Met Thr Gly Ser 
215 220 225 230 

gtc tct gca ggt gag aag ate atg gcg act gcg gcg aaa aac ate acc 774 
Val Ser Ala Gly Glu Lys lie Met Ala Thr Ala Ala Lys Asn lie Thr 
15 235 240 245 

aaa gtg tgt ctg gaa ttg ggg ggt aaa gca cca get ate gta atg gac 822 
Lys Val Cys Leu Glu Leu Gly Gly Lys Ala Pro Ala lie Val Met Asp 
250 255 260 

gat gec gat ctt gaa ctg gca gtc aaa gec ate gtt gat tea cgc gtc 870 
20 Asp Ala Asp Leu Glu Leu Ala Val Lys Ala lie Val Asp Ser Arg Val 
265 270 275 

att aat agt ggg caa gtg tgt aac tgt gca gaa cgt gtt tat gta cag 918 
lie Asn Ser Gly Gin Val Cys Asn Cys Ala Glu Arg Val Tyr Val Gin 
280 285 290 

25 aaa ggc att tat gat cag ttc gtc aat egg ctg ggt gaa gcg atg cag 966 
Lys Gly lie Tyr Asp Gin Phe Val Asn Arg Leu Gly Glu Ala Met Gin 
295 300 305 310 
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gcg gtt caa ttt ggt aac ccc get gaa cgc aac gac att gcg atg ggg 1014 
Ala Val Gin Phe Gly Asn Pro Ala Glu Arg Asn Asp lie Ala Met Gly 
315 320 325 

ccg ttg att aac gec gcg gcg ctg gaa agg gtc gag caa aaa gtg gcg 1062 
5 Pro Leu He Asn Ala Ala Ala Leu Glu Arg Val Glu Gin Lys Val Ala 
330 335 340 

cgc gca gta gaa gaa ggg gcg aga gtg gcg ttc ggt ggc aaa gcg gta 1110 
Arg Ala Val Glu Glu Gly Ala Arg Val Ala Phe Gly Gly Lys Ala Val 
345 350 355 

10 gag ggg aaa gga tat tat tat ccg ccg aca ttg ctg ctg gat gtt cgc 1158 
Glu Gly Lys Gly Tyr Tyr Tyr Pro Pro Thr Leu Leu Leu Asp Val Arg 
360 365 370 

cag gaa atg teg att atg cat gag gaa acc ttt ggc ccg gtg ctg cca 1206 
Gin Glu Met Ser He Met His Glu Glu Thr Phe Gly Pro Val Leu Pro 
15 375 380 385 390 

gtt gtc gca ttt gac acg ctg gaa gat get ate tea atg get aat gac 1254 
Val Val Ala Phe Asp Thr Leu Glu Asp Ala He Ser Met Ala Asn Asp 
395 400 405 

agt gat tac ggc ctg acc tea tea ate tat acc caa aat ctg aac gtc 1302 
20 Ser Asp Tyr Gly Leu Thr Ser Ser He Tyr Thr Gin Asn Leu Asn Val 
410 415 420 

gcg atg aaa gee att aaa ggg ctg aag ttt ggt gaa act tac ate aac 1350 
Ala Met Lys Ala He Lys Gly Leu Lys Phe Gly Glu Thr Tyr He Asn 
425 430 435 

25 cgt gaa aac ttc gaa get atg caa ggc ttc cac gee gga tgg cgt aaa 13 98 
Arg Glu Asn Phe Glu Ala Met Gin Gly Phe His Ala Gly Trp Arg Lys 
440 445 450 
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tec ggt att ggc ggc gca gat ggt aaa cat ggc ttg cat gga tat ctg 1446 
Ser Gly He Gly Gly Ala Asp Gly Lys His Gly Leu His Gly Tyr Leu 
455 460 465 470 

cag acc cag gtg gtt tat tta cag tct taagagctcg aattcccgtc 1493 
5 Gin Thr Gin Val Val Tyr Leu Gin Ser 

475 

gaeggctcta gaetcgageg 1513 



<210> 6 

<211> 479 

10 <212> PRT 

<213> Escherichia coli 

<400> 6 

Met Ser Val Pro Val Gin His Pro Met Tyr He Asp Gly Gin Phe Val 
1 5 10 15 

15 Thr Trp Arg Gly Asp Ala Trp He Asp Val Val Asn Pro Ala Thr Glu 
20 25 30 

Ala Val He Ser Arg He Pro Asp Gly Gin Ala Glu Asp Ala Arg Lys 
35 40 45 

Ala He Asp Ala Ala Glu Arg Ala Gin Pro Glu Trp Glu Ala Leu Pro 
20 50 55 60 

Ala He Glu Arg Ala Ser Trp Leu Arg Lys He Ser Ala Gly He Arg 
65 70 75 80 

Glu Arg Ala Ser Glu He Ser Ala Leu He Val Glu Glu Gly Gly Lys 
85 90 95 

25 He Gin Gin Leu Ala Glu Val Glu Val Ala Phe Thr Ala Asp Tyr He 
100 105 110 
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Asp Tyr Met Ala Glu Trp Ala Arg Arg Tyr Glu Gly Glu lie lie Gin 
115 120 125 

Ser Asp Arg Pro Gly Glu Asn lie Leu Leu Phe Lys Arg Ala Leu Gly 
130 135 140 

5 Val Thr Thr Gly lie Leu Pro Trp Asn Phe Pro Phe Phe Leu lie Ala 
145 150 155 160 

Arg Lys Met Ala Pro Ala Leu Leu Thr Gly Asn Thr lie Val lie Lys 
165 170 175 

Pro Ser Glu Phe Thr Thr Asn Asn Ala He Ala Phe Ala Lys He Val 
10 180 185 190 

Asp Glu He Gly Leu Pro Arg Gly Val Phe Asn Leu Val Leu Gly Arg 
195 200 205 

Gly Glu Thr Val Gly Gin Glu Leu Ala Gly Asn Pro Lys Val Ala Met 
210 215 220 

15 Val Ser Met Thr Gly Ser Val Ser Ala Gly Glu Lys lie Met Ala Thr 
225 230 235 240 

Ala Ala Lys Asn lie Thr Lys Val Cys Leu Glu Leu Gly Gly Lys Ala 
245 250 255 

Pro Ala He Val Met Asp Asp Ala Asp Leu Glu Leu Ala Val Lys Ala 
20 260 265 270 

He Val Asp Ser Arg Val He Asn Ser Gly Gin Val Cys Asn Cys Ala 
275 280 285 

Glu Arg Val Tyr Val Gin Lys Gly He Tyr Asp Gin Phe Val Asn Arg 
290 295 300 
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Leu Gly Glu Ala Met Gin Ala Val Gin Phe Gly Asn Pro Ala Glu Arg 
305 310 315 320 



Asn Asp lie Ala Met Gly Pro Leu lie Asn Ala Ala Ala Leu Glu Arg 
325 330 335 

5 Val Glu Gin Lys Val Ala Arg Ala Val Glu Glu Gly Ala Arg Val Ala 
340 345 350 

Phe Gly Gly Lys Ala Val Glu Gly Lys Gly Tyr Tyr Tyr Pro Pro Thr 
355 360 365 

Leu Leu Leu Asp Val Arg Gin Glu Met Ser lie Met His Glu Glu Thr 
10 370 375 380 

Phe Gly Pro Val Leu Pro Val Val Ala Phe Asp Thr Leu Glu Asp Ala 
385 390 395 400 

lie Ser Met Ala Asn Asp Ser Asp Tyr Gly Leu Thr Ser Ser lie Tyr 
405 410 415 

15 Thr Gin Asn Leu Asn Val Ala Met Lys Ala lie Lys Gly Leu Lys Phe 
420 425 430 

Gly Glu Thr Tyr lie Asn Arg Glu Asn Phe Glu Ala Met Gin Gly Phe 
435 440 445 

His Ala Gly Trp Arg Lys Ser Gly lie Gly Gly Ala Asp Gly Lys His 
20 450 455 460 

Gly Leu His Gly Tyr Leu Gin Thr Gin Val Val Tyr Leu Gin Ser 
465 470 475 



-20- 



■Ml :«»,P 



JL •>!...>; -n;™ 



p c t /* u s a o s e 3 e 7' e 



<210> 7 
<211> 1574 
<212> DNA 

<213> Escherichia coli 



5 <220> 

<221> CDS 

<222> (22) . . (1557) 



<400> 7 

gcggtaccaa ggaggtatca t atg acc aat aat ccc cct tea gca cag att 51 
10 Met Thr Asn Asn Pro Pro Ser Ala Gin lie 

15 10 



aag ccc ggc gag tat ggt ttc ccc etc aag tta aaa gec cgc tat gac 99 
Lys Pro Gly Glu Tyr Gly Phe Pro Leu Lys Leu Lys Ala Arg Tyr Asp 
15 20 25 



15 aac ttt att ggc ggc gaa tgg gta gec cct gec gac ggc gag tat tac 147 
Asn Phe lie Gly Gly Glu Trp Val Ala Pro Ala Asp Gly Glu Tyr Tyr 
30 35 40 



cag aat ctg acg ccg gtg acc ggg cag ctg ctg tgc gaa gtg gcg tct 195 
Gin Asn Leu Thr Pro Val Thr Gly Gin Leu Leu Cys Glu Val Ala Ser 
20 45 50 55 



teg ggc aaa cga gac ate gat ctg 
Ser Gly Lys Arg Asp lie Asp Leu 
60 65 

aaa gat aaa tgg gcg cac acc teg 
25 Lys Asp Lys Trp Ala His Thr Ser 
75 80 



gcg ctg gat get gcg cac aaa gtg 243 
Ala Leu Asp Ala Ala His Lys Val 
70 

gtg cag gat cgt gcg gcg att ctg 2 91 
Val Gin Asp Arg Ala Ala lie Leu 
85 90 



ttt aag att gec gat cga atg gaa caa aac etc gag ctg tta gcg aca 33 9 
Phe Lys lie Ala Asp Arg Met Glu Gin Asn Leu Glu Leu Leu Ala Thr 
95 100 105 
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get gaa acc tgg gat aac ggc aaa ccc att cgc gaa acc agt get gcg 387 
Ala Glu Thr Trp Asp Asn Gly Lys Pro lie Arg Glu Thr Ser Ala Ala 
110 115 120 

gat gta ccg ctg gcg att gac cat ttc cgc tat ttc gec teg tgt att 43 5 
5 Asp Val Pro Leu Ala lie Asp His Phe Arg Tyr Phe Ala Ser Cys lie 
125 130 135 

egg gcg cag gaa ggt ggg ate agt gaa gtt gat age gaa acc gtg gee 483 
Arg Ala Gin Glu Gly Gly lie Ser Glu Val Asp Ser Glu Thr Val Ala 
140 145 150 

10 tat cat ttc cat gaa ccg tta ggc gtg gtg ggg cag att ate ccg tgg 531 
Tyr His Phe His Glu Pro Leu Gly Val Val Gly Gin lie lie Pro Trp 
155 160 165 170 

aac ttc ccg ctg ctg atg gcg age tgg aaa atg get ccc gcg ctg gcg 579 
Asn Phe Pro Leu Leu Met Ala Ser Trp Lys Met Ala Pro Ala Leu Ala 
15 175 180 185 

9 C S 99 c aac tgt gtg gtg ctg aaa ccc gca cgt ctt acc ccg ctt tct 627 
Ala Gly Asn Cys Val Val Leu Lys Pro Ala Arg Leu Thr Pro Leu Ser 
190 195 200 

gta ctg ctg eta atg gaa att gtc ggt gat tta ctg ccg ccg ggc gtg 675 
20 Val Leu Leu Leu Met Glu lie Val Gly Asp Leu Leu Pro Pro Gly Val 
205 210 215 

gtg aac gtg gtc aat ggc gca ggt ggg gta att ggc gaa tat ctg gcg 723 
Val Asn Val Val Asn Gly Ala Gly Gly Val He Gly Glu Tyr Leu Ala 
220 225 230 

25 acc teg aaa cgc ate gee aaa gtg gcg ttt acc ggc tea acg gaa gtg 771 
Thr Ser Lys Arg He Ala Lys Val Ala Phe Thr Gly Ser Thr Glu Val 
235 240 245 250 
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ggc caa caa att atg caa tac gca acg caa aac att att ccg gtg acg 819 
Gly Gin Gin lie Met Gin Tyr Ala Thr Gin Asn He He Pro Val Thr 
255 260 265 

ctg gag ttg ggc ggt aag teg cca aat ate gtc ttt get gat gtg atg 867 
5 Leu Glu Leu Gly Gly Lys Ser Pro Asn He Val Phe Ala Asp Val Met 
270 275 280 



gat gaa gaa gat gee ttt ttc gat aaa gcg ctg gaa ggc ttt gca ctg 915 
Asp Glu Glu Asp Ala Phe Phe Asp Lys Ala Leu Glu Gly Phe Ala Leu 
285 290 295 

ttt gec ttt aac cag ggc gaa gtt tgc acc tgt ccg agt cgt get tta 963 
Phe Ala Phe Asn Gin Gly Glu Val Cys Thr Cys Pro Ser Arg Ala Leu 
300 305 310 



gtg cag gaa tct ate tac gaa cgc 
Val Gin Glu Ser 11^ Tyr Glu Arg 
15 315 320 

gtc gaa age att cgt age ggt aac 
Val Glu Ser He Arg Ser Gly Asn 
335 



ttt atg gaa cgc gee ate cgc cgt 1011 
Phe Met Glu Arg Ala He Arg Arg 
325 330 

ccg etc gac age gtg acg caa atg 1059 
Pro Leu Asp Ser Val Thr Gin Met 
340 345 



ggc gcg cag gtt tct cac ggg caa ctg gaa acc ate etc aac tac att 1107 
20 Gly Ala Gin Val Ser His Gly Gin Leu Glu Thr He Leu Asn Tyr He 
350 355 360 



25 



gat ate ggt aaa aaa gag ggc get gac gtg etc aca ggc ggg egg cgc 1155 
Asp He Gly Lys Lys Glu Gly Ala Asp Val Leu Thr Gly Gly Arg Arg 
365 370 375 

aag ctg ctg gaa ggt gaa ctg aaa gac ggc tac tac etc gaa ccg acg 1203 
Lys Leu Leu Glu Gly Glu Leu Lys Asp Gly Tyr Tyr Leu Glu Pro Thr 
380 385 390 
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att ctg ttt ggt cag aac aat atg egg gtg ttc cag gag gag att ttt 1251 
lie Leu Phe Gly Gin Asn Asn Met Arg Val Phe Gin Glu Glu lie Phe 
395 400 405 410 

ggc ccg gtg ctg gcg gtg acc acc ttc aaa acg atg gaa gaa gcg ctg 12 99 
5 Gly Pro Val Leu Ala Val Thr Thr Phe Lys Thr Met Glu Glu Ala Leu 

415 420 425 

gag ctg gcg aac gat acg caa tat ggc ctg ggc gcg ggc gtc tgg age 1347 
Glu Leu Ala Asn Asp Thr Gin Tyr Gly Leu Gly Ala Gly Val Trp Ser 
430 435 440 

10 cgc aac ggt aat ctg gec tat aag atg ggg cgc ggc ata cag get ggg 13 95 
Arg Asn Gly Asn Leu Ala Tyr Lys Met Gly Arg Gly He Gin Ala Gly 
445 450 455 

cgc gtg tgg acc aac tgt tat cac get tac ccg gca cat gcg gcg ttt 1443 
Arg Val Trp Thr Asn Cys Tyr His Ala Tyr Pro Ala His Ala Ala Phe 
15 460 465 470 

ggt ggc tac aaa caa tea ggt ate ggt cgc gaa acc cac aag atg atg 1491 
Gly Gly Tyr Lys Gin Ser Gly He Gly Arg Glu Thr His Lys Met Met 
475 480 485 490 

ctg gag cat tac cag caa acc aag tgc ctg ctg gtg age tac teg gat 1539 
20 Leu Glu His Tyr Gin Gin Thr Lys Cys Leu Leu Val Ser Tyr Ser Asp 

495 500 505 

aaa ccg ttg ggg ctg ttc taagagctcg aattege 1574 
Lys Pro Leu Gly Leu Phe 
510 
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<210> 8 
<211> 512 
<212> PRT 

<213> Escherichia coli 



5 <400> 8 

Met Thr Asn Asn Pro Pro Ser Ala Gin lie Lys Pro Gly Glu Tyr Gly 
15 10 15 

Phe Pro Leu Lys Leu Lys Ala Arg Tyr Asp Asn Phe lie Gly Gly Glu 
20 25 30 

10 Trp Val Ala Pro Ala Asp Gly Glu Tyr Tyr Gin Asn Leu Thr Pro Val 
35 40 45 

Thr Gly Gin Leu Leu Cys Glu Val Ala Ser Ser Gly Lys Arg Asp lie 
50 55 60 

Asp Leu Ala Leu Asp Ala Ala His Lys Val Lys Asp Lys Trp Ala His 
15 65 70 75 80 

Thr Ser Val Gin Asp Arg Ala Ala He Leu Phe Lys He Ala Asp Arg 
85 90 95 

Met Glu Gin Asn Leu Glu Leu Leu Ala Thr Ala Glu Thr Trp Asp Asn 
100 105 110 

20 Gly Lys Pro He Arg Glu Thr Ser Ala Ala Asp Val Pro Leu Ala He 
115 120 125 

Asp His Phe Arg Tyr Phe Ala Ser Cys He Arg Ala Gin Glu Gly Gly 
130 135 140 

He Ser Glu Val Asp Ser Glu Thr Val Ala Tyr His Phe His Glu Pro 
25 145 150 155 160 
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Leu Gly Val Val Gly Gin He He Pro Trp Asn Phe Pro Leu Leu Met 
165 170 175 



Ala Ser Trp Lys Met Ala Pro Ala Leu Ala Ala Gly Asn Cys Val Val 
180 185 190 

5 Leu Lys Pro Ala Arg Leu Thr Pro Leu Ser Val Leu Leu Leu Met Glu 
195 200 205 

He Val Gly Asp Leu Leu Pro Pro Gly Val Val Asn Val Val Asn Gly 
210 215 220 

Ala Gly Gly Val He Gly Glu Tyr Leu Ala Thr Ser Lys Arg He Ala 
10 225 230 235 240 

Lys Val Ala Phe Thr Gly Ser Thr Glu Val Gly Gin Gin He Met Gin 
245 250 255 

Tyr Ala Thr Gin Asn He He Pro Val Thr Leu Glu Leu Gly Gly Lys 
260 265 270 

15 Ser Pro Asn He Val Phe Ala Asp Val Met Asp Glu Glu Asp Ala Phe 
275 280 285 

Phe Asp Lys Ala Leu Glu Gly Phe Ala Leu Phe Ala Phe Asn Gin Gly 
290 295 300 

Glu Val Cys Thr Cys Pro Ser Arg Ala Leu Val Gin Glu Ser He Tyr 
20 305 310 315 320 

Glu Arg Phe Met Glu Arg Ala He Arg Arg Val Glu Ser He Arg Ser 

325 330 335 

Gly Asn Pro Leu Asp Ser Val Thr Gin Met Gly Ala Gin Val Ser His 
340 345 350 
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Gly Gin Leu Glu Thr lie Leu Asn Tyr lie Asp lie Gly Lys Lys Glu 
355 360 365 



Gly Ala Asp Val Leu Thr Gly Gly Arg Arg Lys Leu Leu Glu Gly Glu 
370 375 380 

5 Leu Lys Asp Gly Tyr Tyr Leu Glu Pro Thr lie Leu Phe Gly Gin Asn 
385 390 395 400 

Asn Met Arg Val Phe Gin Glu Glu He Phe Gly Pro Val Leu Ala Val 
405 410 415 

Thr Thr Phe Lys Thr Met Glu Glu Ala Leu Glu Leu Ala Asn Asp Thr 
10 420 425 430 

Gin Tyr Gly Leu Gly Ala Gly Val Trp Ser Arg Asn Gly Asn Leu Ala 
435 440 445 

Tyr Lys Met Gly Arg Gly He Gin Ala Gly Arg Val Trp Thr Asn Cys 
450 455 460 

15 Tyr His Ala Tyr Pro Ala His Ala Ala Phe Gly Gly Tyr Lys Gin Ser 
465 470 475 480 

Gly He Gly Arg Glu Thr His Lys Met Met Leu Glu His Tyr Gin Gin 
485 490 495 

Thr Lys Cys Leu Leu Val Ser Tyr Ser Asp Lys Pro Leu Gly Leu Phe 
20 500 505 510 



<210> 9 
<211> 5267 
<212> DNA 

<213> Klebsiella pneumoniae 
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<220> 

<223> Location complement 300. .2153 
<220> 

<223> Location complement 2166.. 2591 
5 <220> 

<223> Locaton complement 2594.. 3034 
<220> 

<223> Location complement 2191.. 4858 





<400> 9 
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360 




gtggcgaccg 


cattgcgcgg 


cccttctgtt 


ccccgaatat 


tgccctgccc 


ggcgaccacg 


420 




ccatagtgcg 


acaaggcttc 


cgtgataagc 


tgcgggatct 


caaagtccag 


egatgagecg 


480 




cccaccagca 


ccacaaaggc 


gatatcgcga 


atggaaccgc 


egggtgagae 


ctggcgcagc 


540 




gcgcgcaggc 


agttggtgac 


aaacactttc 


tctttcgcct 


gccggcgcac 


gagacgaatt 


600 


20 


ttttccagcg 


ggctggcgtt 


atcgatcggc 


accagttcgc 


cctccttgat 


gtacaccact 


660 




ttggcgaaca 


ccgccgggct 


gagggcttcc 


cgaaagaact 


ccaccgcgcc 


attctcgtga 


720 




cgaatactga 


acaggctttc 


cactttggcc 


agcgggtatt 


tttttatege 


ttccgccagc 


780 




gaaagatcct 


cgaggcccag 


ctcggtttta 


atcaacaggc 


tgaccatatt 


ccccgccccg 


840 




gcgagatgga 


ccgccgttat 


ctgcccctcc 


gcgttgacga 


tcgccgcatc 


cgtcgagccg 


900 


25 


gcgccgaggt 


cgaggatcgc 


cagcggcgcc 


gcacagccgg 


gagtggttaa 


cgccccggcg 


960 




atggccatgt 


tggcctccac 


gccgcccacc 


accacctcgg 


tetgeagteg 


ggcgctcagt 


1020 




tcgcgggcga 


taacctgcat 


ttgcagacga 


tccgctttca 


ccatcgccgc 


catcccgacg 


1080 




gcattctcca 


tggcgcactc 


gccggccatc 


ccgccctgca 


ccttgcgcgg 


aataaacgta 


1140 




tccaccgcca 


gcagatcctg 


gatgtatatc 


gcgctcatct 


catggccggt 


cagggacgcc 


1200 


30 


attaccttgc 


gcacccgctc 


aagcatgccg 


ccggcgtggg 


tgcccggttc 


gecgeggatg 


1260 




tcgcgtaccg 


gagcgcaggc 


gctcatcgcc 


tgcatgatgg 


cttccgcgcc 


ctcggcgaca 


1320 




tcggcctctc 


cgcggcgctt 


ttcgccgcta 


atgtagaggt 


tgcccgccgg 


gatcacccgc 


1380 




gactgcacat 


ccccctgcgg 


ggtcttgagc 


accaccgcgg 


aacggttgcc 


aatcagggcg 


1440 
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cgggcgatgg ggacgatggc ctgggtctct 
ccgtagggat tcgacaggat ccgcaccacc 
attaccccct cggggacctg ctccagcagc 
cgcaggcggt tgttcaccag cacgccgtcg 
5 atcccccggt cgagcgcctc attgagccac 
tcaatcagta cgatccaccc ctcggcatac 
agggcgatag tcgtccccac gccaacgccc 
atcatggtcg attcggtgat aatggtctcg 
accggcgcgg cttcgttaag atagatgcga 
10 gccagggcct gctccagcgc ggcgagggtc 
cccgtcgtcg cgacgatccc gctggcaaca 
agcgccacct cggtggtggc gttgccgata 
tccgcttagc ttcctttacg cagcttatgc 
acaaaggcgg cattcactgt cgcatgccag 

15 agctccgcct gcgaggagcg gaacgggcgc 
tcaggaatgg cgataagctc cgccgcgcgg 
atctgctcgg caatctgcgc ctggtactca 
gggcccacct cgccagagag caccttctcg 
ggcgtcagga tatgctccgg gcagcgggtg 

20 ttctcgctca tggtcactcc cttactaagt 
gcaccacatg tttggtctct ttgatatgaa 
gcaccatctg atcgttcacc accggcaccg 
cagcgttttt gccaatctgc cggtaggtct 
gctccaggtt gctgagcggc agcagatcgc 

25 ggataccgat gccgatcccc gagccgctca 
cgtcggacgt gcgcagaatg cgcaccaccc 
caatcagctc tttgaggatc gcgccatggg 
tatcgaaggc agggccgacg ccgatcacca 
ccccgccctc gcgggttttc agggtaaaag 

30 gaataccgcc ttattcaatg gtgtcgggct 
cccagcgttc ggcagagatg cgatagccgg 
tgaccgcact caccacctcg aactgccgat 
cggtgacccg ctggcgcagc atattgagaa 
ggctcagcgc gccgacaata tcgaggccgg 

35 cactcagatc ctccaccacg ttacgcggcg 
cggcctccac ctcctcgtcg gcgattggcg 
tcgcccgcgc cgctttctgg cgaatggcaa 



tccgggctta gcccgaagaa ggtggcgatc 1500 
tggcccggcg cggccacttc caccgccgcc 1560 
gtcacttcat ccaccaccgg cagggtttta 1620 
tcctttttga ggatcgccgc caccacgttg 1680 
cacacggcgt caaggaaatc gacggcgtcg 1740 
tgcgccgccg gcagcgtcgc cagccgcccg 1800 
accccgcccg gcgtctgcgg gttatgaccg 1860 
gtgatggtct ccatcgccac atcgccaatc 1920 
gagacatcgc tcatcgacca cggtgttttc 1980 
ccggcgatat tgtcccgcgt ccctttcatg 2040 
aacgccctcg cctgcgggta gtcggacgcc 2100 
tcaatcccgg ctattaacgg catgctgacc 2160 
cgctgctgat acacttccgc cgactcccgg 2220 
gtgtgctcca gctcgtcggc gatcgccagc 2280 
agcgcgttat agatagccag aatgcgctcg 2340 
cggaaattgc gcgccaccgc atggcgctgc 2400 
agggtctggc gggagatccg cacatcctgc 2460 
agggtaatat cggtcaatgg tttgccggta 2520 
gctaacggat aatcctgcac gcgcatggtt 2580 
cgatgtgcag ggtgacgggc tcggcgtcct 2640 
atagcgcggc tttggccata aatttcggcc 2700 
gcgaaggtga ctctttgcgc gcatagcgcg 2760 
ccagcgtcag cagcggcgcc tgggagaaca 2820 
gctgatggat gaccgtggtc cccttcgact 2880 
ggttggccgc atcccaggcc ataaaggaga 2 940 
gggcgtgaag cccctcttct tccaccccgg 3000 
gcatatcgat cagagtgtga tgctggtgtt 3060 
cttcatcggc gcgttcatcg gcagaagcta 3120 
agggctgaat ttgggttgtc tgttgcacag 3180 
gaaccacgcc cggaatattt ttgatctccg 3240 
tgcccggccc ctgatagtca ttgatgtcgt 33 00 
cgagaatggc cgaggtctgc aggtaatcgc 3360 
tattgctggc gatatcctca aagccgctgc 3420 
tgatgttgcg cttcatcatc tcttccaccg 3480 
gcatctcgtt gctgccgtgc gcgtaggtgg 3540 
gcagccccag ctcgcggaaa accgcctgga 3600 
tggtttccgc ctcggtcacc ggacgcaggc 3660 



-29- 



id /2387B 



cgccgtcaac catcaggtca cgctgcagga 
agttcgagcc ggcgaacatg ttgtcgtagt 
agtcggtgcc cggcagcatc tgcatcaggg 
agaaagtctg gtcgttggcg gacgccactt 
5 ccgccagcac cgcccgaatg cccgacggca 
cgccgttttg cagtccctga accccggcgc 
ggtagagcat cgacttgctc tccgaatagc 
tgtagcgcat tttcaacccg cgggaggcgt 
gcgtatcatc gccgtcggta aataccgctt 

10 agctggttaa gccacgcatg cccagctcca 
tcaacacgcc ggggcggccg cactgcgaac 
cgtagcgcgc gataccgacc gtggtctcct 
cggcgtcagc ggcaatctgc accggattat 
agggggtccg gcgggcacgc atcttctgca 

15 gcgccatcac ctcgaccgct ttggccggcg 
cccggctgac gtgaatatcc accagcatac 
cctgctctgt gcgctcaacg ttgatcgcgt 
actggtcccg gcgtttgccg tccagttcga 
agaccgggtc aaaggggctg tccatggcga 

20 gcccgtcctg attgacgggg cgctgggcca 
ttcatcggct caaaaggtga aatccgcaga 
gccgcccggc cattaccggc aatagcggaa 
atttaatttc gtttcaattt ggcacacgaa 
ctccggcggc aaaaataaaa aatgtgatcg 

25 acgcccttaa ttacgttttt ccgacgctat 
gacaaaataa cgctgacaaa aataaagcaa 
cgccccctac tgactattcg cgccagcgtt 



tgttgtaatc atcaaaatct tccgcatcga 3720 
tcggcaccgc gctgtagccg gagaaaataa 3780 
tgcgcgcggt gcggcgaata tccgagtggg 3840 
cgaggtcgag catagaggcg atcaggtttt 3900 
cagcgccggt catgccgata cagctcaccg 3960 
ctttagtaat gaagatgcag cgcgattcga 4020 
ccatcagcgc ttcggatccg gtgccggagg 4080 
aggccgaggc gaggaacgcc tttgaccacg 4140 
cggtgccgta gaccgacacc gtctcggcgt 4200 
gctcggtggc ctcttccacc gagcactgcg 4260 
cgaccaacag cgccagggcg ttaaacggcg 4320 
gttctgagaa gccgcggatc ccggcctcgg 4380 
ctttgagatt ggtgacgtgg cactggttgg 4440 
gcgccatcat catctccacc acgttcatct 4500 
tgatggcggt agtgatggca atgatctcct 4560 
gggctatttc caccgcctcc aggcgcattg 4620 
aatcggcgat aaatcggtcg atcatgtcaa 4680 
cgatcagacc gttgtccact tttactgaag 4740 
tcagcccctc ttcaggccac tcgccaatca 4800 
gtactgcaaa tcgttttgat cttttcattg 4860 
cggtagcgaa tacgccgggc cagcgtcgtt 4 920 
ctttaaatga gccagtggtg aaaaaaataa 4980 
atctaccgac agtttcacta tgaaacttta 5040 
cccgcaatga tataaatcaa ttaataaaaa 5100 
tttaacccta ttgactaaat catggcgggc 5160 
gccaaccgaa tggtaatagt tttttactat 5220 
atcctggtgc gggagaga 5268 



<210> 10 

<211> 607 

30 <212> PRT 

<213> Klebsiella pneumoniae 

<400> 10 

Met Pro Leu lie Ala Gly lie Asp lie Gly Asn Ala Thr Thr Glu Val 
15 10 15 
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Ala Leu Ala Ser Asp Tyr Pro Gin Ala Arg Ala Phe Val Ala Ser Gly 
20 25 30 

He Val Ala Thr Thr Gly Met Lys Gly Thr Arg Asp Asn He Ala Gly 
35 40 45 

5 Thr Leu Ala Ala Leu Glu Gin Ala Leu Ala Lys Thr Pro Trp Ser Met 
50 55 eo 

Ser Asp Val Ser Arg He Tyr Leu Asn Glu Ala Ala Pro Val He Gly 
65 70 75 80 

Asp Val Ala Met Glu Thr He Thr Glu Thr He He Thr Glu Ser Thr 
10 85 90 95 

Met He Gly His Asn Pro Gin Thr Pro Gly Gly Val Gly Val Gly Val 
100 105 1X0 

Gly Thr Thr He Ala Leu Gly Arg Leu Ala Thr Leu Pro Ala Ala Gin 
115 120 125 

15 Tyr Ala Glu Gly Trp He Val Leu He Asp Asp Ala Val Asp Phe Leu 
130 135 140 

Asp Ala Val Trp Trp Leu Asn Glu Ala Leu Asp Arg Gly He Asn Val 
145 150 155 160 

Val Ala Ala He Leu Lys Lys Asp Asp Gly Val Leu Val Asn Asn Arg 
20 165 170 175 

Leu Arg Lys Thr Leu Pro Val Val Asp Glu Val Thr Leu Leu Glu Gin 
180 185 190 

Val Pro Glu Gly Val Met Ala Ala Val Glu Val Ala Ala Pro Gly Gin 
195 200 205 
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Val Val Arg lie Leu Ser Asn Pro Tyr Gly lie Ala Thr Phe Phe Gly 
210 215 220 

Leu Ser Pro Glu Glu Thr Gin Ala He Val Pro He Ala Arg Ala Leu 
225 230 235 240 

5 He Gly Asn Arg Ser Ala Val Val Leu Lys Thr Pro Gin Gly Asp Val 

245 250 255 

Gin Ser Arg Val He Pro Ala Gly Asn Leu Tyr He Ser Gly Glu Lys 
260 265 270 

Arg Arg Gly Glu Ala Asp Val Ala Glu Gly Ala Glu Ala He Met Gin 
10 275 280 285 

Ala Met Ser Ala Cys Ala Pro Val Arg Asp He Arg Gly Glu Pro Gly 
290 295 300 

Thr His Ala Gly Gly Met Leu Glu Arg Val Arg Lys Val Met Ala Ser 
305 310 315 320 

15 Leu Thr Gly His Glu Met Ser Ala He Tyr He Gin Asp Leu Leu Ala 

325 330 335 

Val Asp Thr Phe He Pro Arg Lys Val Gin Gly Gly Met Ala Gly Glu 
340 345 350 

Cys Ala Met Glu Asn Ala Val Gly Met Ala Ala Met Val Lys Ala Asp 
20 355 360 365 

Arg Leu Gin Met Gin Val He Ala Arg Glu Leu Ser Ala Arg Leu Gin 
370 375 380 

Thr Glu Val Val Val Gly Gly Val Glu Ala Asn Met Ala He Ala Gly 
385 390 395 400 
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Ala Leu Thr Thr Pro Gly Cys Ala Ala Pro Leu Ala lie Leu Asp Leu 
405 410 415 

Gly Ala Gly Ser Thr Asp Ala Ala lie Val Asn Ala Glu Gly Gin lie 
420 425 430 

5 Thr Ala Val His Leu Ala Gly Ala Gly Asn Met Val Ser Leu Leu He 
435 440 445 

Lys Thr Glu Leu Gly Leu Glu Asp Leu Ser Leu Ala Glu Ala He Lys 
450 455 460 

Lys Tyr Pro Leu Ala Lys Val Glu Ser Leu Phe Ser He Arg His Glu 
10 465 470 475 480 

Asn Gly Ala Val Glu Phe Phe Arg Glu Ala Leu Ser Pro Ala Val Phe 
485 490 495 

Ala Lys Val Val Tyr lie Lys Glu Gly Glu Leu Val Pro He Asp Asn 
500 505 510 

15 Ala Ser Pro Leu Glu Lys He Arg Leu Val Arg Arg Gin Ala Lys Glu 
515 520 525 

Lys Val Phe Val Thr Asn Cys Leu Arg Ala Leu Arg Gin Val Ser Pro 
530 535 540 

Gly Gly Ser He Arg Asp He Ala Phe Val Val Leu Val Gly Gly Ser 
20 545 550 555 560 

Ser Leu Asp Phe Glu He Pro Gin Leu He Thr Glu Ala Leu Ser His 
565 570 575 

Tyr Gly Val Val Ala Gly Gin Gly Asn He Arg Gly Thr Glu Gly Pro 
580 585 590 
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Arg Asn Ala Val Ala Thr Gly Leu Leu Leu Ala Gly Gin Ala Asn 
595 600 605 



<210> 11 
<211> 141 
5 <212> PRT 

<213> Klebsiella pneumoniae 

<400> 11 

Met Ser Glu Lys Thr Met Arg Val Gin Asp Tyr Pro Leu Ala Thr Arg 
15 10 15 

10 CyB Pro Glu His lie Leu Thr Pro Thr Gly Lys Pro Leu Thr Asp lie 
20 25 30 

Thr Leu Glu Lys Val Leu Ser Gly Glu Val Gly Pro Gin Asp Val Arg 
35 40 45 

lie Ser Arg Gin Thr Leu Glu Tyr Gin Ala Gin lie Ala Glu Gin Met 
15 50 55 60 

Gin Arg His Ala Val Ala Arg Asn Phe Arg Arg Ala Ala Glu Leu He 
65 70 75 80 

Ala He Pro Asp Glu Arg He Leu Ala He Tyr Asn Ala Leu Arg Pro 
85 90 95 

20 Phe Arg Ser Ser Gin Ala Glu Leu Leu Ala He Ala Asp Glu Leu Glu 
100 105 110 

His Thr Trp His Ala Thr Val Asn Ala Ala Phe Val Arg Glu Ser Ala 
115 120 125 

Glu Val Tyr Gin Gin Arg His Lys Leu Arg Lys Gly Ser 
25 130 135 140 
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<210> 12 
<211> 146 
<212> PRT 

<213> Klebsiella pneumoniae 
5 <400> 12 

Met Pro His Gly Ala lie Leu Lys Glu Leu lie Ala Gly Val Glu Glu 
15 10 15 

Glu Gly Leu His Ala Arg Val Val Arg lie Leu Arg Thr Ser Asp Val 
20 25 30 

10 Ser Phe Met Ala Trp Asp Ala Ala Asn Leu Ser Gly Ser Gly lie Gly 
35 40 45 

lie Gly lie Gin Ser Lys Gly Thr Thr Val lie His Gin Arg Asp Leu 
50 55 60 

Leu Pro Leu Ser Asn Leu Glu Leu Phe Ser Gin Ala Pro Leu Leu Thr 
15 65 70 75 80 

Leu Glu Thr Tyr Arg Gin lie Gly Lys Asn Ala Ala Arg Tyr Ala Arg 
85 90 95 

Lys Glu Ser Pro Ser Pro Val Pro Val Val Asn Asp Gin Met Val Arg 
100 105 110 

20 Pro Lys Phe Met Ala Lys Ala Ala Leu Phe His He Lys Glu Thr Lys 
115 120 125 

His Val Val Gin Asp Ala Glu Pro Val Thr Leu His He Asp Leu Val 
130 135 140 

Arg Glu 
25 145 
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<210> 13 
<211> 555 
<212> PRT 

<213> Klebsiella pneumoniae 
5 <400> 13 

Met Lys Arg Ser Lys Arg Phe Ala Val Leu Ala Gin Arg Pro Val Asn 
15 10 15 

Gin Asp Gly Leu lie Gly Glu Trp Pro Glu Glu Gly Leu lie Ala Met 
20 25 30 

10 Asp Ser Pro Phe Asp Pro Val Ser Ser Val Lys Val Asp Asn Gly Leu 
35 40 45 

lie Val Glu Leu Asp Gly Lys Arg Arg Asp Gin Phe Asp Met lie Asp 
50 55 60 

Arg Phe lie Ala Asp Tyr Ala lie Asn Val Glu Arg Thr Glu Gin Ala 
15 65 70 75 80 

Met Arg Leu Glu Ala Val Glu lie Ala Arg Met Leu Val Asp He His 
85 90 95 

Val Ser Arg Glu Glu He He Ala He Thr Thr Ala He Thr Pro Ala 
100 105 110 

20 Lys Ala Val Glu Val Met Ala Gin Met Asn Val Val Glu Met Met Met 
115 120 125 

Ala Leu Gin Lys Met Arg Ala Arg Arg Thr Pro Ser Asn Gin Cys His 
130 135 140 

Val Thr Asn Leu Lys Asp Asn Pro Val Gin He Ala Ala Asp Ala Ala 
25 145 150 155 160 
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Glu Ala Gly He Arg Gly Phe Ser Glu Gin Glu Thr Thr Val Gly He 
165 170 175 

Ala Arg Tyr Ala Pro Phe Asn Ala Leu Ala Leu Leu Val Gly Ser Gin 
180 185 190 

5 Cys Gly Arg Pro Gly Val Leu Thr Gin Cys Ser Val Glu Glu Ala Thr 
195 200 205 

Glu Leu Glu Leu Gly Met Arg Gly Leu Thr Ser Tyr Ala Glu Thr Val 
210 215 220 

Ser Val Tyr Gly Thr Glu Ala Val Phe Thr Asp Gly Asp Asp Thr Pro 
10 225 230 235 240 

Trp Ser Lys Ala Phe Leu Ala Ser Ala Tyr Ala Ser Arg Gly Leu Lys 
245 250 255 

Met Arg Tyr Thr Ser Gly Thr Gly Ser Glu Ala Leu Met Gly Tyr Ser 
260 265 270 

15 Glu Ser Lys Ser Met Leu Tyr Leu Glu Ser Arg Cys He Phe He Thr 
275 280 285 

Lys Gly Ala Gly Val Gin Gly Leu Gin Asn Gly Ala Val Ser Cys He 
290 295 300 

Gly Met Thr Gly Ala Val Pro Ser Gly He Arg Ala Val Leu Ala Glu 
20 305 310 315 320 

Asn Leu He Ala Ser Met Leu Asp Leu Glu Val Ala Ser Ala Asn Asp 
325 330 335 

Gin Thr Phe Ser His Ser Asp He Arg Arg Thr Ala Arg Thr Leu Met 
340 345 350 
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Gin Met Leu Pro Gly Thr Asp Phe lie Phe Ser Gly Tyr Ser Ala Val 
355 360 365 

Pro Asn Tyr Asp Asn Met Phe Ala Gly Ser Asn Phe Asp Ala Glu Asp 
370 375 380 

5 Phe Asp Asp Tyr Asn lie Leu Gin Arg Asp Leu Met Val Asp Gly Gly 
3Q 5 390 395 400 

Leu Arg Pro Val Thr Glu Ala Glu Thr He Ala He Arg Gin Lys Ala 
405 410 415 

Ala Arg Ala He Gin Ala Val Phe Arg Glu Leu Gly Leu Pro Pro He 
10 420 425 430 

Ala Asp Glu Glu Val Glu Ala Ala Thr Tyr Ala His Gly Ser Asn Glu 
435 440 445 

Met Pro Pro Arg Asn Val Val Glu Asp Leu Ser Ala Val Glu Glu Met 
450 455 460 

15 Met Lys Arg Asn He Thr Gly Leu Asp He Val Gly Ala Leu Ser Arg 
465 470 475 480 

Ser Gly Phe Glu Asp He Ala Ser Asn He Leu Asn Met Leu Arg Gin 
485 490 495 

Arg Val Thr Gly Asp Tyr Leu Gin Thr Ser Ala He Leu Asp Arg Gin 
20 500 505 510 

Phe Glu Val Val Ser Ala Val Asn Asp He Asn Asp Tyr Gin Gly Pro 
515 520 525 

Gly Thr Gly Tyr Arg He Ser Ala Glu Arg Trp Ala Glu He Lys Asn 
530 535 540 
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lie Pro Gly Val Val Gin Pro Asp Thr He Glu 
545 550 555 



<210> 14 

<211> 56 

5 <212> DNA 

<213> Escherichia coli 



<400> 14 

gctaccatgg cttaaccggt accaaggaga tatcatatgt cagtacccgt tcaaca 56 



<210> 15 
10 <211> 59 
<212> DNA 

<213> Escherichia coli 
<400> 15 

gcctcgagtc tagagccgtc gacgggaatt cgagctctta agactgtaaa taaaccacc 59 



15 <210> 16 
<211> 46 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 16 

20 gcggtaccaa ggaggtatca tatgttcagt agatctacgc tctgct 46 



<210> 17 
<211> 33 
<212> DNA 

<213> Saccharomyces cerevisiae 
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<400> 17 

gcgaattcga gctcttactc gtccaatttg gcac 34 



<210> 18 
<211> 35 
5 <212> DNA 

<213> Homo sapiens 

<400> 18 

gcggtaccaa ggaggtatca tatgtcagcc gccgc 35 



<210> 19 
10 <211> 39 
<212> DNA 
<213> Homo sapiens 

<400> 19 

gcgaattcga gctcttatga gttcttctga ggcactttg 



15 <210> 20 
<211> 44 
<212> DNA 

<213> Escherichia coli 
<400> 20 

20 gcggtaccaa ggaggtatca tatgaccaat aatccccctt cage 44 



<210> 21 
<211> 38 
<212> DNA 

<213> Escherichia coli 
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<400> 21 

gcgaattcga gctcttagaa cagccccaac ggtttatc 38 



<210> 22 
<211> 20 
5 <212> DNA 

<213> Escherichia coli 

<400> 22 

atcccgccgt taaccaccat 



<210> 23 
10 <211> 34 
<212> DNA 

<213> Escherichia coli 
<400> 23 

gcggtaccat tgttatccgc tcacaattcc acac 



15 QBMAD\223318 



-41- 



:C3i . 



«=a.. ii„tt( -ii..ti> 



US 00/23878 



CLAIM OR CLAIMS 

I/WE CLAIM: 

1 , A method for producing 3-hydroxypropionic acid comprising the steps of 
providing in a fermenter a recombinant microorganism which expresses genes 
5 for non-native enzymes which are capable of catalyzing the production of 3- 
hydroxypropionic acid from glycerol; 

providing a source of glycerol or glucose for the recombinant microorganism, 

and 

fermenting the microorganism under conditions which result in the accumulation 
10 of 3-hydroxypropionic acid. 



2. A method for producing 3-hydroxypropionic acid comprising the steps of 
providing in a fermenter a recombinant microorganism which carries genetic 
constructions for the expression of a glycerol dehydratase and an aldehyde 
dehydrogenase which are capable of catalyzing the production of 3-hydroxypropionic 
1 5 acid from glycerol; 

providing a source of glycerol or glucose for the recombinant microorganism, 

and 

fermenting the microorganism under conditions which result in the accumulation 
of 3-hydroxypropionic acid. 
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3. A method for producing 3-hydroxypropionic acid comprising the steps of 
providing in a fermenter a recombinant microorganism which carries a genetic 
construct which expresses the dhaB gene from Klebsiella pneumoniae and a gene for an 
aldehyde dehydrogenase, which are capable of catalyzing the production of 3- 
5 hydroxypropionic acid from glycerol; 

providing a source of glycerol or glucose for the recombinant microorganism, 

and 

fermenting the microorganism under conditions which result in the accumulation 
of 3-hydroxypropionic acid. 

10 4. The method of claim 3 wherein the gene for the aldehyde dehydrogenase is 

selected from the group consisting of ALDH4, ALD2, aldA and aldB. 

5. The method of claim 3 wherein the aldehyde dehydrogenase is selected from 
the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. 

6. A recombinant E. coli host comprising in its inheritable genetic materials 
1 5 foreign genes encoding a glycerol dehydratase and an aldehyde dehydrogenase, such 

that the host is capable of producing 3-hydroxypropionic acid from glycerol. 

7. A recombinant E. coli host comprising in its inheritable genetic materials the 
dhaB gene from Klebsiella pheumoniae and the ald4 gene from Saccharomycetes 
cervisiae y such that the host is capable of producing 3-hydroxypropionic from glycerol. 
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8. A bacterial host comprising in its inheritable genetic material a genetic 
construction encoding for the expression of a glycerol dehydratase enzyme and an 
aldehyde dehydrogenase enzyme, such that the bacterial host is capable of converting 
glycerol to 3-hydroxypropionic acid. 

5 9. The bacterial host of claim 8 wherein the glycerol dehydratase from 

Klebsiella pneumoniae, 

10. The bacterial host of claim 8 wherein the gene encoding the glycerol 
dehydratase is the dhaB gene from Klebsiella pneumoniae. 

1 1 . The bactciial host of claim 8 wherein the aldehyde dehydrogenase is 

10 selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6 and 
SEQIDNO:8. 

12. The bacterial host of claim 8 wherein the gene for the aldehyde 
dehydrogenase is selected from the group consisting of ALDH4, ALD2, aldA and aldB. 
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ABSTRACT OF THE DISCLOSURE 
The production of 3-hydroxypropionic acid (3-HP) from glycerol in a bacterial 
host is described. 3-HP is a useful feedstock for the production of polymeric materials. 
The genetic engineering of a bacterial host with two enzymes is sufficient to enable 
production of 3-HP. One enzyme is a glycerol dehydratase and the other is an aldehyde 
dehydrogenase. 

QBMAD\223318 
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